
pr^T 1 WORLD INTELLECTUAL PROPERTY ORGANIZATION 

A International Bureau 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCI ) 



(51) International Patent Classification 6 : 

C12N 15/12, 15/62, SAO, 5/20, C07K 
14/705, 16/18, G01N 33/68, 33/50, A01K 
67/027 



Al 



(11) International Publication Number: WO 98746749 

(43) International Publication Date: 22 October 1998 (22.10.98) 



(21) International Application Number: PCT/US98/07283 

(22) International Filing Date: 10 April 1998 (10.04.98) 



(30) Priority Data: 

08/838,845 
09/058,389 



11 April 1997 (11.04.97) 
9 April 1998 (09.04.98) 



US 
US 



(71) Applicant: ST. J UDE CHILDREN'S RESEARCH HOSPITAL 

[US/US]; 332 North Lauderdale Street, Memphis, TN 
38105-2794 (US). 

(72) Inventors: BELT, Judith, A.; 817 Pecan Gardens Circle, West, 

Memphis, TN 38122 (US). CRAWFORD, Charles, R.; 1040 
Cabana Circle, East #7, Memphis, TN 38107 (US). PATEL, 
Divyen, H.; 1861 Jefferson Avenue, Memphis, TN 38104 
(US). 

(74) Agents: DAVIS, Michael, D. et al.; Klauber & Jackson, 41 1 
Hackensack Avenue, Hackensack, NJ 07601 (US). 



(81) Designated States: AU, CA, JP, European patent (AT, BE, 
CH, CY, DE, DK, ES, FL FR, GB, GR, IE, IT, LU, MC, 
NL, PT, SE). 



Published 

With international search report. 

Before the expiration of the lime limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: A NITROBENZYLMERCAPTOPURLNERIBOSIDE (NBMPR)-INSENSITIVE, EQUILIBRATTVE. NUCLEOSIDE TRANS- 
PORT PROTEIN, NUCLEIC ACIDS ENCODING THE SAME AND METHODS OF USE 



uSH< : 1. 1 in mji jj i . j .u jm ■■ ■ ; »* 




A 



(57) Abstract 

An isolated NBMPR-insensitive equtlibrative nucleoside transport protein (iENTP) and the nucleic acid encoding it is disclosed. The 
iENTP can be used in screening assays to identify both natural nucloside permeants and/or inhibitors and analogs thereof. In addition, 
transfected or transduced cell lines are disclosed which use the iENTP as the sole nucleoside transport protein. Methods of employing 
such cell lines for drug screening are also included. Furthermore methods of using hematopoietic stem cells transduced with an iENTP in 
a chemotherapy protocol is also described. In addition, methods of using these cells to selectively express a heterologous gene for gene 
therapy is disclosed. 
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A NTTROBENZYLMERCAPTOPUWNERIBOSIDE (NBMPRHNSENSITIVE, 
EQUILIBRATIVE, NUCLEOSIDE TRANSPORT PROTEIN, NUCLEIC ACIDS 
ENCODING THE SAME AND METHODS OF USE 

RESEARCH SUPPORT 

5 The research leading to the present invention was supported in part by R01 -CA55056 and 
Cancer Center CORE Support grant P30-CA21765 from the National Cancer Institute. The 
government may have certain rights in the present invention. Support for this invention was 
also provided by the AMERICAN LEBANESE SYRIAN ASSOCIATED CHARITIES. 

FIELP OF W INVENTION 

10 The invention relates generally to the equilibrative transport of nucleosides into cells, and 
more particularly to nitrobenzylmercaptopurineriboside (NBMPR)-insensitive. equilibrative, 
nucleoside transport proteins (/ENTPs), to nucleic acids which encode the proteins, methods 
of use of the proteins and nucleic acids, and antibodies to the proteins. 

BACKGROUND OF THE INVENTION 

15 Aside from being potential precursors to the building blocks of nucleic acids, the natural 

nucleosides are important metabolites having many physiological effects in assorted organs. 
For example, adenosine and its corresponding nucleotides are local signaling molecules that 
act through purinergic receptors to affect such varied physiological functions as lipolysis, 
neurotransmitter release, coronary vasodilation, cardiac contractility, renal vasoconstriction, 

20 and bronchial constriction: and thus extracellular adenosine concentrations can have 
significant effects on cardiac and vascular functions as well as play a role in 
neuromodulation [reviewed by Griffith et al Biochim. Biophys. Acta Rev. Biomembr ^ 
1286: 1 53- 181(1 996); Cass, in Drug Transport in Antimicrobial Therapy and Anticancer 
Therapy (N.H. Georgopapadakou, ed.(Marcel Dekker)), 403-451 (1995)]. In addition, 

25 nucleoside analogs are presently employed as anti-retro viral drugs, and as anticancer drugs. 
Although some extracellular nucleosides can passively permeate the plasma membrane, most 
participate in some form of protein mediated transport performed by nucleoside transport 
proteins. Nucleoside transport proteins play an important role in the uptake and efflux of 
physiological nucleosides used in DNA and RNA synthesis, lipid and glycogen metabolism, 

30 and glycoprotein and glycolipid synthesis. Furthermore nucleoside transport proteins 

mediate the uptake and efflux of a number of antitumor and antiviral nucleoside analogs in 
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cells [Cass, 1995, supra]. Nucleoside transport inhibitors are currently being investigated as 
modulators of adenosine action in cerebral and cardiac ischemia to provide protection from 
reperfusion injury [Rongen et aL, J. Clin. Invest. 95:658-668 (1995); Parkinson el al, Gen, 
Pharmacol 25: 1 053- 1058 (1 994)]. 

5 The first nucleoside transporters studied functioned as facilitated diffusion systems. Such 
equilibrative nucleoside transport proteins were initially classified solely by their sensitivity 
to nitrobenzylmercaptopurineriboside (NBMPR). As the study of these proteins progressed, 
additional characteristics such as permeant selectivity and tissue distribution have been used 
to further distinguish these proteins [Griffith and Jarvis, Biochim. Biophys. Ada Rev. 
10 Biomembr., 1286:153-181 (1996)]. More recently, sodium -depend ent con centrative 
nucleoside transport proteins have also been identified. N 

At least five distinct nucleoside transport activities have been identified that differ in their 
permeant selectivity, sensitivity to inhibitors and distribution in normal tissues and tumors 
[Griffith and Jarvis, 1996, supra]. Two of these activities exhibit equilibrative mechanisms 
1 5 that mediate both the influx and efflux of nucleosides across the plasma membrane, while the 
other three activities exhibit concentrative, sodium -dependent mechanisms that under 
physiological conditions mediate only the influx of nucleosides. 

The major equilibrative carrier in most cells, es (equilibrative, sensitive) is highly sensitive to 
the inhibitor NBMPR, having IC 50 values of 0.1 to 1 nM. A human homolog of this protein 
20 (hENTl ) has recently been cloned (Griffiths et al y Nature Med. 3:89-93 (1997). [t has 10 to 
1 1 predicted membrane spanning regions and has some structural similarities to the 
equilibrative glucose carriers. It does not however, share sequence homology with the 
glucose transporter family and appears to represent a new family of membrane transport 
proteins designated ENT for equilibrative nucleoside transporter. 

25 Many cells also contain a second equilibrative transporter ei (equilibrative, insensitive) that 
is insensitive to nanomolar concentrations of NBMPR, but can be inhibited by higher (u.M) 
concentrations [Belt, Mol Pharmacol, 24:479-484 (1983); Plagemann and Wohlheuter, 
Biochim. Biophys. Acta, 773:39-52 (1984)]. This protein, an NBMPR-insensitive 
equilibrative nucleoside transport protein (z'ENTP) has remained elusive. Both of the 

30 equilibrative transporters accept a broad range of physiological nucleosides and their 
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cytotoxic and antiviral analogs as permeants, although there appear to be differences in their 
affinity for some nucleosides [Griffith and Jarvis, 1996, supra]. 

/ENTPs also are present in most tumor cells, although the level of fENTP appears to be 
variable. The concentration of /ENTP in a particular tumor cell is likely to be a major 
5 determinant in the ability of that cell to grow following the administration of an es transport 
inhibitor to block the nucleoside salvage pathway, together with an inhibitor of de novo 
nucleoside synthesis, such as trimetrexatc, methotrexate, and tomudex. The level of /ENTP 
in a tumor cell is also likely to be a determinant of the success of using es inhibitors to block 
the efflux of cytotoxic and antiviral nucleoside analogs from cells. Under such 
1 0 circumstances, cells with higher concentrations of /ENTP will have a higher efflux of 
cytotoxic and antiviral nucleoside analogs, unless an inhibitor of the iENTP is also 
administered. 

NBMPR and its congeners are the most specific and potent inhibitors of the es transporter 
currently available. The es transporter has a high-affinity binding site for NBMPR that 

1 5 overlaps at least in part with the substrate binding site [Jarvis, in Adenosine Receptors, 

D.M.F. Cooper and C. Londos, eds., (New York: Alan R. Liss, Inc.), pp. 1 13-123 (1988)]. 
NBMPR binds to this site with a dissociation constant of 0. 1 to 1 nM and completely inhibits 
nucleoside uptake via es at concentrations in the nanomolar range [Paterson and Cass, in 
Membrane Transport of Antineoplastic Agents, ID. Goldman, ed. y (New York: Pcrgamon 

20 Press), pp. 309-329 (1986); Gati and Paterson, in The red cell membrane: structure, function, 
and clinical implications, P. Agre andJ.C. Parker, eds., (New York: Marcel Decker), pp. 
635-661 (1989); Jarvis, J 988, supra\ Plagemann et al, Biochim. Biophys. Acta., 969:1-8 
(1988)]. At high concentrations (>1 uM), however, NBMPR also inhibits the ei transporter 
[Paterson et al, Mol. Parmacoi, 18:40-44 (1980); Belt, Mol Pharmacol, 24:479-484 

25 (1983); Plagemann and Wohlheuter, Biochim. Biophys. Acta. t 773:39-52 (1984)]. 

Dipyridamole also binds to the NBMPR-binding site of es [Jarvis, Mol. Pharmacol, 30:659- 
665 (1986)], but is a less potent inhibitor of es than NBMPR [Plagemann and Wohlheuter, 
Curr. Topics Membr. Trans., 14:225-330 (1980); Paterson and Cass, 1986, supra; Plagemann 
and Woffedin, Biochim. Biophys. Acta., 969:1-8 (1988)]. 
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Dipyridamole also inhibits the ei transporter, but its potency against this transporter has been 
unclear. It has been suggested that the es transporter and the ei transporter are equally 
sensitive to dipyridamole since the curves for inhibition of nucleoside transport are 
monophasic in cells that possess both transporters [Jarvis, 1988, supra; Plagemann et al, 
5 1988, supra]. However, recent studies with Ehrlich ascites tumor cells in which the es 
transporter was blocked by addition of low concentrations of NBMPR, suggest that the ei 
transporter is significantly less sensitive to dipyridamole than es [Hammond, J. Pharmacol 
Exp. Ther, t 259:799-807 (1991)]. 



In addition to the two cquilibrative nucleoside transporters there are at least three Na + - 

1 0 dependent, concentrative nucleoside transport activities that differ from each other, and from 
the equilibrative transporters, in their substrate specificity. Two of these, cif and cit (also 
called Nl and N2), exhibit selectivity for purine and pyrimidine nucleosides respectively 
[Vijayalakshmi et ah, J. Biol Chem., 263:19419-19423 (1988) and Williams et al, Biochem. 
J., 264:223-23 1 (1 991 )]; while the third, cib (also called N3), has a broader selectivity 

15 accepting both purine and pyrimidine nucleosides [Wu et al,J. Biol Chem., 267:8813-8818 
(1992); Huang et al , J. Biol Chem., 268:20613-20620 (1993)]. All three of the 
concentrative nucleoside transporters are insensitive to NBMPR and dipyridamole at 
concentrations up to 10 yM; and under physiological conditions mediate only the influx of 
nucleoside into cells. These concentrative transport activities have been observed 

20 predominantly in normal tissues such as kidney [Le Hir and Dubach et al, Pflugers Arch., 
401:58-63 (1984); Williams et al, Biochem. J., 264:223-23 3 (1989); Williams et al, 
Biochem. J., 274:27-33 (1991); Le Hir etal, Pflugers Arch., 401:58-63 (1990)] and intestine 
Schwenk et al, Biochim. Biophys. Acta., 805:370-374 (1984); Vijayalakshmi etal, J. Biol 
Chem., 263:19419-19423 (1988); Williams et al, Biochem. ./., 274:27-33 (1991), and appear 

25 to be the major nucleoside transport activity in the specialized epithelial cells of these tissues 
[Williams et al, Biochem. J., 274:27-33 (1 989); Vijayalakshmi et al, J. Biol Chem., 
263:19419-19423 (1 988)]. However, low levels of Na + -dependent nucleoside transport have 
been observed in some tumor cells lines (Lee et al, Biochem. J., 274:85-90 (1991); Belt et 
al, Mol Pharmacol, 24:479-484 (1 993); Crawford et al, J. Biol Chem., 265: 1 3730-1 3734 

30 (1990b); Dagnino et al, Cancer Res., 50:6549-6553 (1990)]. 

cDNA clones have recently been obtained for two of the concentrative nucleoside 
transporters. Cass and co-workers have cloned rCNTl from rat intestine. This cDNA 
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encodes a 71 Kd protein with cit-type transport activity in transient expression studies in 
Xenopus oocytes [Huang et aL, J. Biol. Chem., 269:1 7757-17760 (1994)] and COS cells 
[Fang et a/., Biochem. J., 3 1 7:457-465 (1996)]. The second transporter. rSPNT (rCNT2) was 
cloned from rat liver and encodes a 72 Kd protein that has cif-type transport activity in 
5 expression studies in Xenopus oocytes. The CNT1 and SPNT transporters are 64% identical 
in their deduced amino acid sequences, and have significant homology with the bacterial 
nupC nucleoside transporters. They do not, however, have significant homology with any 
known mammalian proteins, and thus represent a new family of mammalian membrane 
transporters. It should be noted that rCNTl and rSPNT do not share homology with SNST 
10 [Pajor et aL, J. Biol Chenh, 267:3557-3560 (1992)], a member of the sodium-dependent 
glucose transporter family that has weak nucleoside transport activity when expressed in 
Xenopus oocytes. It is not yet known whether SNST represents a significant nucleoside 
transport activity in mammalian cells. The human homolog of CNT1 has recently been 
cloned [Ritzel et aL Ant. J. Physiol. (1997)]. 

1 5 The isolation and cloning of nucleoside transport proteins allows the biochemical 

characteristics of these transport proteins to be individually investigated and exploited. Such 
analysis is important for drug development for example, in which drugs can be more readily 
designed to inhibit specific transport mechanisms. Unfortunately, heretofore, no NBMPR- 
insensitive equilibrative transport protein has been isolated or cloned, which has severely 

20 hampered analogous studies with this major class of nucleoside transporters . 

The citation of any reference herein should not be deemed as an admission that such 
reference is available as prior art to the instant invention. 



SUMMARY OF THE TNVRNTT ON 



Nucleosides play a central role in cellular metabolism. The nucleoside salvage pathway is an 
25 important means employed by cells to maintain the requisite amount of these important 

metabolites. The initial step in the nucleoside salvage pathway is their transport across the 
plasma membrane. The key mode of transport of nucleosides into the cell is performed by 
nucleoside transport proteins contained in the plasma membranes. The present disclosure 
reports the first isolation and cloning of a cDNA encoding an NBMPR-insensitive 
30 equilibrative nucleoside transporter. 
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The present invention provides a purified transmembrane protein with nucleoside transport 
activity and the active fragments thereof. The transmembrane protein transports nucleosides 
across the plasma membrane through a facilitated diffusion process. More specifically, the 
transmembrane protein is an equilibrative nucleoside transport protein which is insensitive to 
5 nitrobenzylmercaptopurineriboside (NBMPR). In one embodiment the NBMPR insensitive, 
equilibrative nucleoside transport protein (/ENTP) contains approximately 450 amino acid 
residues, and 8 to 12 putative transmembrane domains. In one such embodiment the /ENTP 
is a vertebrate protein, in a preferred embodiment the /ENTP is a mammalian protein. In a 
more preferred embodiment the /ENTP is a human protein containing 456 amino acids and 
10 has 10 to 11 putative transmembrane domains. 



One aspect of the present invention provides an isolated nucleic acid which encodes an 
/ENTP of the present invention that includes exons and introns as shown in Figure 6. In a 
preferred embodiment of this type, the isolated nucleic acid contains the nucleotide 
sequences of SEQ ID NO:5 and SEQ ID NO: 10. The introns of the gene are individually part 

15 of the present invention, having nucleotide sequences of SEQ IDNOs:! 1, 12, 13, 14, 15, 16, 
17, 18, and 19 for introns 1-9 respectively. The 5* portion of intron 10 has the nucletodie 
sequence of SEQ ID NO:20 whereas the 3' portion of intron 10 has the nucleotide sequence 
of SEQ ID NO:21 . Intron 1 1 has the nucleotide sequence of SEQ ID NO:22. Nucleic acid 
probes which hybridize to the isolated nucleic acid are also included in the present invention. 

20 In a preferred embodiment of this type, the nucleic acid probes hybridize to the untranslated 
portion of the nucleic acid. 

The present invention further provides an isolated nucleic acid that contains a nucleotide 
sequence of the genomic 5' flanking region of a gene encoding an /ENTP. In a preferred 
embodiment of this type, the isolated nucleic acid has the nucleotide sequence of SEQ ID 
25 NO:6. The present invention also includes nucleic acid probes which hybridize to the nucleic 
acid sequence of SEQ ID NO:6. 

Another aspect of the present invention includes isolated nucleic acids encoding the /ENTPs 
and active fragments thereof. One such isolated nucleic acid encodes an amino acid 
sequence of a transmembrane protein that functions as an equilibrative nucleoside transport 
30 protein that is insensitive to NBMPR. In a particular embodiment the nucleic acid encodes 
an /ENTP that contains approximately 450 amino acid residues. In one embodiment of this 
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type, the isolated nucleic acid has a nucleotide sequence with at least 80% similarity With the 
coding sequence of the human /ENTP (hENT2), SEQ ID NO; 1 . In another embodiment of 
this type, the isolated nucleic acid has a nucleotide sequence with at least 80% identity with 
the coding sequence of the human /ENTP (hENT2), SEQ ID NO: 1 . In still another 
.5 embodiment the isolated nucleic acid has the nucleotide sequence of nucleotides 238-1605 of 
SEQ ID NO: 1 . In yet another embodiment of this aspect of the invention, an isolated nucleic 
acid encodes an /ENTP having the amino acid sequence of hENT2, SEQ ID NO:2. In a 
related embodiment an isolated nucleic acid encodes SEQ ID NO:2 comprising one or more 
conservative substitutions thereof. 

10 The /ENTPs of the present invention, as well as the corresponding nucleic acids which 

encode them can be obtained from any natural source preferably from a vertebrate cell, more 
preferably from a mammalian cell, and most preferably from a human cell. 

The present invention also includes oligonucleotides that hybridize to the nucleic acids 
encoding the /ENTPs of the present invention. In one embodiment the oligonucleotide 
1 5 consists of at least 1 8 nucleotides. In a preferred embodiment, the oligonucleotide consists 
of at least 27 nucleotides. In a more preferred embodiment, the oligonucleotide consists of at 
least 36 nucleotides. Oligonucleotides of the present invention can be used as nucleic acid 
probes, PCR primers, antisense nucleic acids, and the like, including for diagnostic and 
therapeutic purposes. 

20 In one such embodiment the oligonucleotide hybridizes to SEQ ID NO: 1 , or more 

particularly hybridizes to the coding sequence of SEQ ID NO: 1 . In a related embodiment the 
oligonucleotide hybridizes to the nucleotides 5 12-579 of SEQ ID NO: 1 . In one embodiment, 
the hybridization is performed under moderate stringency. In another embodiment, the 
hybridization is performed under standard hybridization conditions. In yet a third 

25 embodiment, the hybridization is performed under stringent hybridization conditions. 

Isolated DNAs that encode the iENTPs of the present invention and active fragments thereof 
are also part of the present invention. In one embodiment, the nucleotide sequence of the 
DNA has at least 80% similarity with the coding sequence of SEQ ID NO: 1 . In another 
embodiment, the nucleotide sequence of the DNA has at least 80% identity with the coding 
30 sequence of SEQ ID NO: 1 . In still another embodiment the DNA has the nucleotide 
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sequence of nucleotides 238-1605 of SEQ lDNO:l . In yet another embodiment the DNA 
encodes an /ENTP having the amino acid sequence of SEQ IDNO:2. In a related 
embodiment the DNA encodes an amino acid sequence of SEQ ID NO:2 comprising one or 
more conservative substitutions thereof in a particular embodiment the DNA is a 
5 recombinant DNA (cDNA). 



In another embodiment, an isolated or recombinant nucleic acid (including a DNA) has at 
least 80% similarity with the coding sequence of SEQ ID NO:7. In another embodiment, the 
nucleotide sequence of the nucleic acid has at least 80% identity with the coding sequence of 
SEQ ID NO:7. In still another embodiment the nucleic acid contains the nucleotide sequence 
10 of SEQ ID NO:7. In yet another embodiment the nucleic acid encodes a protein containing 
the amino acid sequence of SEQ ID NO:8. In a related embodiment the nucleic acid encodes 
an amino acid sequence of SEQ ID NO:8 comprising one or more conservative substitutions 
thereof. In a particular embodiment the DNA is recombinant (cDNA). 

All of the isolated nucleic acids and recombinant DNAs of the present invention can further 
15 comprise a heterologous nucleotide sequence. Such heterologous nucleotide sequences can 
encode, for example, a fusion peptide {e.g., a FLAG-tag) or a chimeric protein partner such 
as a fusion protein. 

The present invention also includes DNA constructs comprising the isolated DNAs encoding 
the /ENTPs of the present invention. In one such embodiment the DNA is operatively linked 
20 to an expression control sequence. In one embodiment the DNA is operatively linked to an 
expression control sequence and encodes the amino acid sequence of SEQ ID NO:2. In 
another embodiment the DNA is operatively linked to an expression control sequence and 
encodes the amino acid sequence of SEQ ID NO:2 comprising one or more conservative 
substitutions thereof. In a particular embodiment the DNA is a recombinant DNA (cDNA). 

25 Also included in the present invention are transfected or transduced cells which are 

transfected or transduced with the recombinant DNA constructs of the present invention. 
The transfected or transduced cells can be either a prokaryotic cell, or a eukaryotic cell. In 
one such embodiment, the transfected cell is a COS cell. In another embodiment, the 
transduced cell is a hematopoietic stem cell. In a particular embodiment, the transfected cell 

30 is a human T-cell leukemia CEM cell. In a preferred embodiment the transfected or 
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transduced cell is transfected or transduced with a DN A construct containing a DNA that is 
operatively linked to an expression control sequence and encodes the amino acid sequence of 
SEQ ID NO:2. In a related embodiment the transfected or transduced cell is transfected or 
transduced with a DNA construct containing a DNA that is operatively linked to an 
5 expression control sequence and encodes the amino acid sequence of SEQ ID NO:2 
comprising one or more conservative substitutions thereof. 

Another aspect of the present invention includes the isolated /ENTPs of the present invention 
and active fragments thereof. In its broadest embodiment the isolated /ENTP is a 
transmembrane protein that is NBMPR insensitive, and functions as an equilibrative 

10 nucleoside transport protein. In a particular embodiment, the /ENTP has approximately 450 
amino acids. In one embodiment the /'ENTP is encoded by a nucleotide sequence having at 
least 80% similarity with the coding sequence of SEQ ID NO: 1 . In another embodiment the 
/ENTP is encoded by a nucleotide sequence having at least 80% identity with the coding 
sequence of SEQ ID NO: I . In still another embodiment the /ENTP has an amino acid 

15 sequence of SEQ ID NO:2 comprising one or more conservative substitutions thereof. In a 
preferred embodiment the isolated /ENTP has an amino acid sequence of SEQ ID NO:2. 

In another embodiment the /ENTP is encoded by a nucleotide sequence having at least 80% 
similarity with the coding sequence of SEQ ID NO:7. In another embodiment the /ENTP is 
encoded by a nucleotide sequence having at least 80% identity with the coding sequence of 
20 SEQ ID NO:7. In still another embodiment the /ENTP has an amino acid sequence of SEQ 
ID NO:8 comprising one or more conservative substitutions thereof. In a preferred 
embodiment the isolated /ENTP has an amino acid sequence of SEQ IDNO:8. 

The present invention also includes modified /ENTPs of the present invention, such as tagged 
proteins, labeled proteins, fusion proteins and the like. Such modified /ENTPs may be used 
25 for example as antigens or for marker purposes. In a particular embodiment of this type, the 
fusion protein comprises an /ENTP protein or active fragment thereof having an amino acid 
sequence of SEQ ID NO:2 or SEQ ID NO:2 comprising a conservative substitution thereof. 
In preferred embodiments the modified /ENTP retains its activity as an NBMPR insensitive 
equilibrative nucleoside transport protein. 
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In a specific embodiment, an /ENTP fusion protein can be expressed. An /ENTP fusion 
protein comprises at least a functionally active portion of a noiw'ENTP protein joined via a 
peptide bond to at least a functionally active portion of an /ENTP polypeptide. In a particular 
embodiment, an /ENTP fusion protein or peptide contains an /ENTP or fragment thereof and 
5 a FLAG-tag. In an alternative embodiment, an /ENTP fusion protein or peptide contains an 
/ENTP or fragment thereof and green fluorescent protein or derivatives thereof, as 
exemplified in U.S. Patent 5,625,048 Issued 4/29/97 and International Publication No: WO 
97/26333, hereby incorporated by reference in their entireties, can also be used. 

The noiwENTP sequences of the /ENTP fusion protein can be amino- or carboxy-terminal to 
1 0 the /ENTP sequences. More preferably, for stable expression of an /ENTP fusion protein 
(including a proteolytically inactive /ENTP fusion protein), the portion of the noiwENTP 
fusion protein is joined via a peptide bond to the amino terminus of the /ENTP protein. A 
recombinant DNA molecule encoding such a fusion protein comprises a sequence encoding 
at least a functionally active portion of a noiwENTP protein joined in-frame to the /ENTP 
1 5 coding sequence. In one such embodiment the DNA molecule encodes a cleavage site for a 
specific protease, e.g., thrombin or Factor Xa, preferably at the /ENTP- noiwENTP juncture. 
In a specific embodiment, the fusion protein is expressed in Escherichia colL 

Antibodies to the /ENTPs of the present invention are also part of the present invention. In a 
20 particular embodiment the antibody is raised against an /ENTP having an amino acid 

sequence of SEQ ID NO:2. In another such embodiment the antibody is raised against an 
/ENTP having an amino acid sequence of SEQ ID NO:2 comprising one or more 
conservative substitutions thereof. In still another embodiment the antibody is raised against 
a portion of, or alternatively all of the N-terminal 92 amino acids of SEQ ID NO:2, i.e., 
25 amino acids 1-92 of SEQ ID NO:2. 

In one embodiment the antibody is a polyclonal antibody. In another embodiment the 
antibody is a monoclonal antibody. In yet another embodiment the monoclonal antibody is a 
chimeric antibody. The present invention also includes an immortal cell line that produces a 
monoclonal antibody of the present invention. 

30 Still another aspect of the present invention includes a transfected or transduced cell in which 
all detectable nucleoside transport activity is performed by the nucleoside transport protein 
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encoded by a nucleic acid of the present invention. In one embodiment of this type, the 
transfected or transduced cell is a vertebrate cell. In a preferred embodiment the transfected 
or transduced cell is a mammalian cell. In a more preferred embodiment the transfected or 
transduced cell is a human cell. In one such embodiment, the transfected cell is a human T- 

.5 cell leukemia CEM cell. In a more particular embodiment of this type the transfected human 
cell is a CEM/N 1-7 cell. In a preferred embodiment of this aspect of the present invention, 
all detectable nucleoside transport activity is performed by an /ENTP having the amino acid 
sequence of SEQ ID NO:2, or an active fragment of that /ENTP. In a related embodiment the 
/ENTP has the amino acid sequence of SEQ ID NO:2 comprising a conservative substitution 

1 0 thereof, or an active fragment of that /ENTP. 



The present invention also includes a nucleoside transport deficient subline of a human T-cell 
leukemia cell line CEM, transfected with an Epstein-Barr Nuclear Antigen 1 expression 
cassette, in which the cell line is capable of supporting the episomal replication of an 
Epstein-Barr virus-based mammalian expression vector. In one particular embodiment of 
1 5 this type the expression vector is pDR2. In a preferred embodiment of this type the cell line 
has a stable transfection frequency with pDR2 of approximately 10 2 . In one particular 
embodiment the nucleoside transport deficient subline is CEM/C19. 

Ribozymes specifically designed to modify the nucleic acids of the present invention are also 
contemplated as part of the present invention. Similarly antisense nucleic acids that 
20 hybridize under physiological conditions to an mRNA encoding an /ENTP of the present 
invention is also included in the present invention. In one such embodiment, the antisense 
nucleic acid hybridizes to the mRNA that corresponds to the sense strand of nucleotides 238- 
1 605 of the nucleotide sequence of SEQ ID NO: 1 . 

A related aspect of the invention is a knockout mouse for the iENTPs of the present 
25 invention. One such embodiment comprises a first and a second allele which naturally 

encode and express the nucleoside transport protein having the amino acid sequence of SEQ 
ID NO:2. Both the first allele and the second allele each contain a defect which prevents the 
knockout mouse from expressing a nucleoside transport protein that is both insensitive to 
NBMPR and can function as an equilibrative nucleoside transport protein. Such a knockout 
30 mouse is particularly susceptible to drugs such as NBMPR. 
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The present invention also includes methods of making and using the i EN TPs, antibodies to 
the /ENTPs, the nucleic acids encoding the i ENTPs, oligonucleotides that hybridize to these 
nucleic acids, DNA constructs containing these nucleic acids, cells containing these 
constructs, as well as to the other compositions and processes of the present invention. 

5 Accordingly, one aspect of the present invention includes a method of isolating a cDNA 
encoding a nucleoside transport protein. This process comprises transfecting a nucleoside 
transport protein deficient cell with an expression vector from an expression vector library, 
wherein the expression vector library contains a vector comprising a cDNA encoding a 
nucleoside transport protein. The cDNA encoding the nucleoside transport protein is 

1 0 expressed in the transfected cell. An expression vector containing the cDN A encoding a 
nucleoside transport protein is selected by culturing the transfected cell under conditions in 
which the cell growth is dependent on the expression of the nucleoside transport protein. 
Therefore the selected expression vector contains the cDNA encoding a nucleoside transport 
protein. The selected expression vector is extracted from the transfected cell. A host cell is 

15 transfected with the selected expression vector, and the cDNA encoding the nucleoside 
transport protein is isolated. 



In a specific embodiment of this type includes a method of isolating a cDNA encoding an 
NBMPR insensitive, equilibrative nucleoside transport protein (iENTP cDNA). This process 
comprises transfecting a nucleoside transport protein deficient cell with an expression vector 

20 from an expression vector library, wherein a cDNA library containing an iENTP cDNA has 
been subcloned into the expression vector library, and wherein the iENTP cDNA is expressed 
in the transfected cell. An expression vector containing the iENTP cDNA is selected by 
culturing the transfected cell under conditions in which the ceil growth is dependent on the 
expression of the /ENTP and its corresponding transport activity, and wherein the selected 

25 expression vector contains the iENTP cDNA. The selected expression vector is extracted 
from the transfected cell. A host cell is transfected with the selected expression vector, and 
the cDNA encoding the NBMPR insensitive, equilibrative nucleoside transport protein is 
isolated. In a preferred embodiment of this type the transfected cell is a human cell that 
expresses EBNA-1 and the human cell is CEM/C19. 

30 Another aspect of the present invention includes a method of making an NBMPR insensitive, 
equilibrative nucleoside transport protein of the present invention through introducing an 
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expression vector comprising a nucleic acid encoding the /ENTP or an active fragment 
thereof into a host cell, and expressing the nucleic acid in the host cell. In one embodiment 
the host cell is a prokaryotic cell. In another embodiment the host cell is a eukaryotic cell. 
In one specific embodiment, the eukaryotic cell is an insect cell. In a particular embodiment 
5 the /ENTP has an amino acid sequence of SEQ ID NO:2. In another particular embodiment 
the iENTP has an amine acid sequence of SEQ ID NO:2 comprising a conservative 
substitution thereof. In one embodiment the method further comprises purifying the 
/ENTP. 



The present invention includes methods for obtaining a purified NBMPR insensitive, 
1 0 equilibrative nucleoside transport protein (/ENTP) or an active fragment thereof, from a cell 
that expresses the /ENTP which comprises lysing the cell, and purifying the NBMPR 
insensitive, equilibrative nucleoside transport protein. In one embodiment the purifying step 
includes extracting the /ENTP from the plasma membrane of the cell. In another such 
embodiment the purifying step also includes fractionating the proteins contained in the ceil. 
15 In a particular embodiment, the /ENTP is obtained from a natural source. In a preferred 
embodiment the natural source is a mammalian cell. In another particular embodiment the 
/ENTP is a recombinant protein obtained from a prokaryotic cell. In still another 
embodiment the /ENTP is a recombinant protein obtained from a eukaryotic cell. In one 
preferred embodiment the /ENTP has an amino acid sequence of SEQ ID NO:2. In another 
20 preferred embodiment the /ENTP has an amino acid sequence of SEQ ID NO:2 comprising a 
conservative substitution thereof. 



Yet another aspect of the invention includes a method of identifying a ligand of an /ENTP of 
the present invention which comprises contacting a potential ligand with the isolated /ENTP 
under physiological conditions (e.g., neutral pH, buffered solution with approximately 150 

25 mM salt) and detecting whether the potential iigand binds to the /ENTP wherein a potential 
ligand is selected as a ligand if it binds to the /ENTP. The ligand and/or the /ENTP can be 
labeled such as with a label defined below. Similarly, either the /ENTP or ligand can be 
attached to a solid support. The binding can be detected with any of the standard protein- 
ligand binding assays known in the art as exemplified below. Once a ligand is identified its 

30 dissociation constant can be determined. Alternatively, the detecting step may be performed 
by determining the dissociation constant initially. In either case a potential ligand is selected 
as a ligand when the dissociation constant is less than 10" 5 M. In one such embodiment the 
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ligand is a permeant of the /ENTP. In another embodiment, the ligand is an inhibitor of the 
/ENTP. In yet another embodiment, the ligand is both a permeant and an inhibitor of the 
/ENTP. In one preferred embodiment the /ENTP has an amino acid sequence of SEQ TD 
NO:2. In another preferred embodiment the /ENTP has an amino acid sequence of SEQ ID 
5 NO:2 comprising a conservative substitution thereof. 

The present invention also includes specific methods of identifying a permeant of an 
NBMPR insensitive, equilibrative nucleoside transport protein (/ENTP). In one such 
embodiment a nucleoside or nucleoside analog is contacted with a transfected or transduced 
cell of the present invention in which all detectable nucleoside transport activity is performed 

10 by an /ENTP of the present invention. The nucleoside transport of the nucleoside or 

nucleoside analog by the transfected or transduced cell is evaluated, wherein the nucleoside 
or nucleoside analog is identified as a permeant when the transport of the nucleoside or 
nucleoside analog into the transfected or transduced cell is determined to follow a facilitated 
diffusion process. In one such embodiment the nucleoside or nucleoside analog is an 

1 5 antiviral nucleoside analog. In another embodiment the nucleoside or nucleoside analog is 
an antitumor nucleoside analog. In one particular embodiment of this type the transfected or 
transduced cell is a transfected or transduced human cell. In one preferred embodiment the 
/ENTP has an amino acid sequence of SEQ ID NO:2. In another preferred embodiment the 
/ENTP has an amino acid sequence of SEQ ID NO:2 comprising a conservative substitution 

20 thereof. 

The present invention further includes specific methods of selecting drugs that inhibit an 
NBMPR insensitive, equilibrative nucleoside transport protein. One such embodiment 
comprises contacting a potential drug with a transfected or transduced cell of the present 
invention in which all delectable nucleoside transport activity is performed by an /ENTP of 
25 the present invention. The nucleoside transport activity of the cell is evaluated. A potential 
drug is selected as a drug when a decrease in the nucleoside transport activity is determined 
relative to that determined when the evaluating was performed in the absence of the potential 
drug. 

In one embodiment of this type the nucleoside transport activity of the transfected or 
30 transduced cell is evaluated as a function of the determination of the trans-stimulation of a 
permeant. In another embodiment the nucleoside transport activity of the transfected or 
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transduce cell is evaluated as a function of the determination of the direct transport of a 
permeant. In still another embodiment the nucleoside transport activity of the transfected or 
transduced cell is evaluated as a function of the determination of the countertran sport of a 
permeant. In one specific embodiment, the nucleoside transport activity of the transfected or 
5 transduced cell is evaluated as a function of the toxicity of a nucleoside analog which is a 
permeant of the /ENTP, such as tubercidin, 2-chloro-2'-deoxyadenosine > or Ara-C. In yet 
another embodiment, the nucleoside transport activity of the transfected or transduced cell is 
evaluated as a function of toxicity in the presence of an antimetabolite. In yet another 
embodiment the nucleoside transport activity of the transfected or transduced cell is 
10 evaluated as a function of two of these determinations. In still another embodiment the 

nucleoside transport activity of the transfected or transduced cell is evaluated as a function of 
all of these determinations. 

Another embodiment of a method of selecting a drug that inhibits an NBMPR insensitive, 
equilibrative nucleoside transport protein (/ENTP) comprises detecting the mutual inhibition 
15 (i.e. mutual competition) of a potential drug with a permeant, such as uridine for the /ENTP 
in a transfected or transduced cell of the present invention in which all detectable nucleoside 
transport activity is performed by an /ENTP of the present invention. A potential drug is 
selected as a drug when mutual inhibition is detected. This embodiment may be used alone 
or in conjunction with the other determinations described above. 

20 For any of the drug assays of the present invention the /ENTP functions as an equilibrative 
nucleoside transport protein, is insensitive to NBMPR, and contains approximately 450 
amino acid residues. In one particular embodiment the /ENTP has an amino acid sequence of 
SEQ ID NO:2. In another embodiment the /ENTP has an amino acid sequence of SEQ ID 
NO:2 comprising a conservative substitution thereof. In a preferred embodiment the 

25 transfected or transduced cell is a human transfected or transduced cell. 

Accordingly, it is a principal object of the present invention to provide a purified NBMPR- 
insensitive, equilibrative nucleoside transport protein (/ENTP). 



It is a further object of the present invention to provide an isolated nucleic acid encoding a 
/ENTP. 
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It is a further object of the present invention to provide a DNA construct containing a nucleic 
acid encoding a iENTP. 

it is a further object of the present invention to provide an antibody specific for a purified 
/ENTP. 

5 It is a further object of the present invention to provide a method of producing an /ENTP, 
including through modification of a /ENTP, and through recombinant technology. 

It is a further object of the present invention to provide a method of selecting a drug that 
preferentially inhibits an /ENTP-dependent nucleoside transport pathway. 

It is a further object of the present invention to provide a method of screening drug libraries 
1 0 for drugs that preferentially inhibit an /ENTP. 

It is a further object of the present invention to provide a cell in which the only detectable 
facilitated diffusion pathway for nucleosides includes an /ENTP. 

It is a further object of the present invention to provide a cell where the NBMPR-insensitive 
1 5 facilitated diffusion pathway for nucleosides includes a modified /ENTP. 

It is a further object of the present invention to provide a method of cancer chemotherapy by 
transducing hematopoietic stem cells ex vivo with a cDNA encoding an /ENTP. introducing 
the transduced cells into an animal subject, and then treating the animal subject with an 
antimetabolite and NBMPR. 

20 It is a further object of the present invention to provide a novel method of hematopoietic cell- 
directed gene therapy using an expression vector encoding the /ENTP. 

These and other aspects of the present invention will be better appreciated by reference to the 
following drawings and Detailed Description. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows uridine uptake in N 1-7 and CI 9 cells. Figure 1 A depicts 3 H-uridtne uptake 
(10 uM at 2.5 uCi/ml) by CEM/N1-7 (•, ♦) and CEM/C19 ( A , T ) cells which was 

determined at 22°C in Na + (filled symbols) or NaMree (□) in the presence (♦, ▼) or absence 

5 (•» Q> A ) of a large excess (4 mM) unlabeled uridine as described in the methods. The 

values shown are means of duplicate assays and have been corrected for extracellular water 
space. Figures I B-1C depict the uptake of 10 u.M 3 H-uridine determined at 22 °C in Na + 
buffer using a 7.5 second uptake interval in the presence of nucleoside transport inhibitors. 
Hie uptake interval was initiated by simultaneous addition of the label and inhibitory 

10 nucleobase, nucleoside or nucleotide. In the case of NBMPR and dipyridamole, the cells 
were incubated for 5 minutes at 22°C in the appropriate concentration of inhibitor prior to 
starting the assay. Uridine uptake was also determined in the presence of a large excess (4 
mM) of unlabeled uridine to determine the radioactivity associated with the extracellular 
water space and simple diffusion. This value was subtracted from the total uridine uptake 

15 values. The results are from triplicate assays and are expressed as percent of uridine uptake 
in the absence of inhibitor. 

Figures 2A and 2B depict the comparison of the amino acid sequences of proteins related to 
hENT2. Two related sequences found in the Genbank database were aligned with hENT2 
using the Pileup program in the GCG suite of sequence analysis software. Residues that are 
20 identical or have conservative substitutions in at least 8 of the 10 sequences are shown on a 
black background, and conservative substitutions in at least 5 of the 1 0 sequences are shown 
on a grey background. Putative transmembrane domains are shown by dashed lines (===), 
theN-glycosylation consensus sequence ****, and the start site for mouse and human 
HNP36 by #. 

25 Figure 3 shows uridine uptake by COS-1 cells transiently transfected with pcDNA3/Nl-71 
constructs. COS-1 cells were transfected with pcDNA3/N1-71orfl (■) or 
pcDNA3/Nl-71orf2 (o) as described in the methods. Control cells (•) were transfected with 
the pcDNA3 plasmid without an insert. Uptake of 10 \iM 3 H-uridine (2 uCi/ml) was 
determined in sodium-free buffer 72 hours after transfection. 0.1 uM NBMPR was present 
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in all assays to block the endogenous es transporter in COS- 1 cells {4539}. The values 
shown are the average of triplicate determination. 

Figure 4 demonstrates the expression of hENT2 in human cell lines and tissues. Northern 
blots of polyA+ RNA from the indicated cell lines (Figure 4A) and tissues (Figure 4B) were 
5 hybridized with a BamHI/Nhel fragment of hENT2 ( 1 .8kb, nucleotides 393-2 1 83 of SEQ ID 
NO:l) and washed at high stringency as described in the methods. 

Figure 5 is a schematic drawing showing the results of challenging CD34 positive 
hematopoietic stem cells with an antimetabolite and a nucleoside transport inhibitor, after the 
cells have been transduced, ex vivo, with a viral vector comprising a nucleic acid encoding 
hENT2. The antimetabolite (e.g., methotrexate, trimetrexate, 5-FU or PALA) plus the 
nucleoside transport inhibitor NBMPR or draflazine are administered to the cells thereby 
selectively enriching for cells that have been successfully transduced with hENT2. The 
enrichment is achieved because the antimetabolite prevents de novo synthesis of the 
nucleosides required for cell growth, and the transduced cells are uniquely resistant to 
NBMPR and draflazine and thereby retaining a functional salvage pathway for purine and 
pyrimidine nucleosides present in exogenous nucleoside pools. Although not shown, the ex 
vivo transduced cells may be transplanted into the animal subject and the antimetabolite and 
nucleoside transport inhibitor may be administered parenterally. 

Figure 6 shows the exon structure of an /ENTP of the present invention. As shown, each 
20 exon approximately defines a different functional domain. TM is short for transmembrane. 

Figure 7 shows the 3.5 KB message isolated from Thymus which is apparently a splice 
variant of an /ENTP. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention in its broadest embodiment provides an equilibrative nucleoside 
25 transport protein which is insensitive to nitrobenzylmercaptopurineriboside, NBMPR, [6-[(4- 
nitrobenzyl)thio-9-p-D-ribofuranosyl purine]. The NBMPR-insensitive nucleoside transport 
protein (/ENTP) is a transmembrane protein that serves to transport nucleosides across the 
plasma membrane through a facilitated diffusion process. The present invention also 



10 



15 
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provides nucleic acids encoding the /ENTPs of the present invention which can be used to 
transfect or transduce mammalian cells for various medical purposes. For example, such a 
transfected or transduced cell can be used as a screening tool for identifying antitumor and 
antiviral nucleoside analogs that can be preferentially transported into cells by this specific 
5 nucleoside transport protein. In addition, hematopoietic cells transduced with an /ENTP of 
the present invention can be used in cancer chemotherapy protocols in which both the de 
novo nucleoside biosynthesis and the major nucleoside salvage pathway of the hematopoietic 
cells are purposely disabled. In such instances the expression of the /ENTP uniquely 
provides the transduced cells with an alternative means of obtaining the required nucleosides. 

10 

Therefore, if appearing herein, the following terms shall have the definitions set out 
below. 

As used herein "/ENTP" and n ei" transporter are interchangeable names for an 
NBMPR-insensitive equilibrative nucleoside transport protein, which is a transmembrane 
15 protein that functions in the facilitated diffusion of nucleosides across cell membranes. As 
disclosed herein, "hENT2" is a human /ENTP that has an amino acid sequence of SEQ ID 
NO:2. The natural nucleic acid sequence encoding hENT2 consists of nucleotides 238-1605 
of SEQ ID NO: I. 

A nucleoside transporter may be classified as being either "NBMPR-insensitive" or 
20 "NBMPR-sensitive." This classification system is widely accepted in the field [See Griffith 
et al., Biochim. Bioph. Acta 1286: 153-181 (1996)] to distinguish two major classes of 
nucleoside transport proteins. "NBMPR-sensitive" indicates that the nucleoside transporter 
has a high sensitivity to the inhibitor NBMPR i.e., 1C 50 values of 0.1 to 1 nanomolar. 
"NBMPR-insensitive" indicates that the nucleoside transporter is insensitive to nanomolar 
25 concentrations of NBMPR, but can be inhibited by higher (e.g., micromolar) concentrations. 

As used herein an "active fragment" of an /ENTP is a polypeptide or glycopolypcptide that 
has an amino acid sequence that corresponds to that of a full-length /ENTP except the active 
fragment has at least one less amino acid than the corresponding full-length /ENTP; further 
an "active fragment" of an /ENTP is NBMPR-insensitive, and has at least 20% of the 
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nucleoside transport activity of the corresponding full-length /ENTP, (determined under 
conditions in which the full-length /ENTP has nucleoside transport activity.) 

As used herein a "functional /ENTP" is a /ENTP that is NBMPR-insensitive and has at least 
20% of the nucleoside transport activity of the corresponding native /ENTP. 

5 As used herein, an "antimetabolite" is a compound that interferes with the synthesis and/or 
metabolism of nucleotides or nucleosides. In one instance an antimetabolite can inhibit de 
novo nucleotide synthesis. In another instance an antimetabolite can be a nucleoside analog 
that interferes with a nucleoside and/or nucleotide-dependent process. Antimetabolites 
include trimetrexate, methotrexate (MTX), N-(phosphonacetyl)-L-aspartic acid (PALA), and 
10 5-fluorouracil (5-FU). 

As used herein "facilitated diffusion" is a carrier- mediated transport system that operates 
along a concentration gradient of the permeating solute. At equilibrium, the solute will attain 
the same concentration on either side of the membrane, as in simple diffusion. The transport 
of a permeant "is determined to follow a facilitated diffusion process" when the kinetic 
1 5 determinations for the transport process are consistent with that predicted for a facilitated 
diffusion process [See Stein, Transport and Diffusion Across Cell Membranes, Academic 
Press, London (1986)]. 

As used herein the term "approximately" is used to signify that a value is within ten percent 
20 of the indicated value i.e., a protein containing "approximately" 450 amino acid residues can 
contain between 405 and 495 amino acid residues. 

As used herein the term "binds to" is meant to include all such specific interactions that result 
in two or more molecules showing a preference for one another relative to some third 
molecule. This includes processes such as covalent, ionic, hydrophobic and hydrogen 
25 bonding but does not include non-specific associations such solvent preferences. 

As used herein a "ligand" of an /ENTP can be either a natural or artificial binding partner for 
the /ENTP which binds to the /"ENTP under physiological conditions forming a binding 
complex. In preferred embodiments the /ENTP-ligand binding complex has a dissociation 
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constant of less than I0 5 M. Ligands include permeants, and/or inhibitors and activators of 
the j'ENTP-dependent nucleoside salvage pathway. 

As used herein a "permeant" is a nucleoside or nucleoside analog that binds to an z'ENTP and 
is transported by the /ENTP across a membrane by a facilitated diffusion process. 

5 As used herein the "/ENTP-dependent nucleoside salvage pathway" is used to denote an 
/ENTP-dependent transport of nucleosides across the plasma membrane of a cell. 

As used herein a cell has "detectable nucleoside transport activity" when the rate of 
nucleoside uptake of the cell is greater than that determined for simple diffusion. The rate of 
diffusion is measured by the uptake of 3 H-uridine (1-10 /^M) over a thirty second time 

10 course, at 22 °C in the presence of a large excess (1-1 OmM) of unlabeled uridine, or a 
competing nucleoside. For example, see Figure 1, where the uptake of 5 H-uridine by 
CEM/C19 cells is not significantly different in the presence or absence of a large excess of 
unlabeled uridine. Similarly, a cell has "no detectable nucleoside transport activity" when 
the rate of nucleoside uptake of the cell is not significantly different than that determined for 

1 5 a simple diffusion process, as determined by the method above. 

A "vector" is a replicon, such as plasmid, phage or cosmid, to which another DNA segment 
may be attached so as to bring about the replication of the attached segment. A ,, replicon ,, is 
any genetic element (e.g., plasmid, chromosome, virus) that functions as an autonomous unit 
of DNA replication in vivo, i.e., capable of replication under its own control. 

20 A "cassette" refers to a segment of DNA that can be inserted into a vector at specific 

restriction sites. The segment of DNA encodes a polypeptide of interest, and the cassette and 
restriction sites are designed to ensure insertion of the cassette in the proper reading frame 
for transcription and translation. 

A cell has been "transfected" by exogenous or heterologous DNA when such DNA has been 
25 introduced inside the cell. The transfecting DNA can be maintained in an episome as 

exemplified in the transfection studies using the CEM/N1-7 cell described herein. A cell has 
been "transduced" by exogenous or heterologous DNA when the exogenous or heterologous 
DNA is introduced by a viral vector. 
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A "heterologous nucleotide sequence" as used herein is a nucleotide sequence that is added to 
a nucleotide sequence of the present invention by recombinant methods to form a nucleic 
acid which is not naturally formed in nature. Such nucleic acids can encode chimeric and/or 
fusion proteins. Thus the heterologous nucleotide sequence can encode peptides and/or 
5 proteins which contain regulatory and/or structural properties. In another such embodiment 
the heterologous nucleotide can encode a protein or peptide that functions as a means of 
detecting the protein or peptide encoded by the nucleotide sequence of the present invention 
after the recombinant nucleic acid is expressed. In still another embodiment the heterologous 
nucleotide can function as a means of detecting a nucleotide sequence of the present 
10 invention. A heterologous nucleotide sequence can comprise non-coding sequences 
including restriction sites, regulatory sites, promoters and the like. 



A "heterologous" region of the DNA construct is an identifiable segment of DNA within a 
larger DNA molecule that is not found in association with the larger molecule in nature. 
Thus, when the heterologous region encodes a mammalian gene, the gene will usually be 

15 flanked by DNA that does not flank the mammalian genomic DNA in the genome of the 
source organism. Another example of a heterologous coding sequence is a construct where 
the coding sequence itself is not found in nature (e.g., a cDNA where the genomic coding 
sequence contains introns, or synthetic sequences having codons different than the negative 
gene). Allelic variations or natural ly-occurring mutational events do not give rise to a 

20 heterologous region of DNA as defined herein. 

A "nucleic acid molecule" refers to the phosphate ester polymeric form of ribonucleosides 
(adenosine, guanosine, uridine or cytidine; "RNA molecules") or deoxyri bo nucleosides 
(deoxyadenosine, deoxyguanosine, deoxythymidinc, or deoxycytidine; "DNA molecules"), or 
any phosphoester analogs thereof, such as phosphorothioates and thioestcrs, in either single 

25 stranded form, or a double-stranded helix. Double stranded DNA-DNA, DNA-RNA and 
RNA-RNA helices are possible. The term nucleic acid molecule, and in particular DNA or 
RNA molecule, refers only to the primary and secondary structure of the molecule, and does 
not limit it to any particular tertiary forms. Thus, this term includes double-stranded DNA 
found, inter alia, in linear or circular DNA molecules (e.g., restriction fragments), plasrnids, 

30 and chromosomes. In discussing the structure of particular double-stranded DNA molecules, 
sequences may be described herein according to the normal convention of giving only the 
sequence in the 5' to 3' direction along the nontranscribed strand of DNA (i.e., the strand 
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having a sequence homologous to the mRNA). A "recombinant DNA molecule" is a DNA 
molecule that has undergone a molecular biological manipulation. 

A nucleic acid molecule is "hybridizable" to another nucleic acid molecule, such as a cDNA, 
genomic DNA, or RNA, when a single stranded form of the nucleic acid molecule can anneal 
5 to the other nucleic acid molecule under the appropriate conditions of temperature and 

solution ionic strength (see Sambrook et aL, supra). The conditions of temperature and ionic 
strength determine the "stringency" of the hybridization. For preliminary screening for 
homologous nucleic acids, low stringency hybridization conditions, corresponding to a T m of 
55°, can be used, e.g., 5x SSC, 0.1% SDS, 0.25% milk, and no formamide; or 30% 

10 formamide, 5x SSC, 0.5% SDS). Moderate stringency hybridization conditions correspond 
to a higher T ro , e.g., 40% formamide, with 5x or 6x SCC. High stringency hybridization 
conditions correspond to the highest T )n , e.g.. 50% formamide, 5x or 6x SCC. Hybridization 
requires that the two nucleic acids contain complementary sequences, although depending on 
the stringency of the hybridization, mismatches between bases are possible. The appropriate 

1 5 stringency for hybridizing nucleic acids depends on the length of the nucleic acids and the 
degree of complementation, variables well known in the art. The greater the degree of 
similarity or homology between two nucleotide sequences, the greater the value of T m for 
hybrids of nucleic acids having those sequences. The relative stability (corresponding to 
higher T m ) of nucleic acid hybridizations decreases in the following order: RNAiRNA, 

20 DNA:RNA, DNA:DNA. For hybrids of greater than 100 nucleotides in length, equations for 
calculating T m have been derived (see Sambrook et aL, supra, 9.50-0.51 ). For hybridization 
with shorter nucleic acids, i.e., oligonucleotides, the position of mismatches becomes more 
important, and the length of the oligonucleotide determines its specificity (see Sambrook et 
al. y supra, 1 1.7-1 1 .8). Preferably a minimum length for a hybridizable nucleic acid is at least 

25 about 12 nucleotides; preferably at least about 18 nucleotides; and more preferably the length 
is at least about 27 nucleotides; and most preferably 36 nucleotides. 

In a specific embodiment, the term "standard hybridization conditions" refers to a T m of 

55 °C, and utilizes conditions as set forth above. In a preferred embodiment, the T m is 60°C; 

in a more preferred embodiment, the T m is 65 °C. 

30 "Homologous recombination" refers to the insertion of a foreign DNA sequence of a vector 
in a chromosome. Preferably, the vector targets a specific chromosomal site for homologous 
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recombination. For specific homologous recombination, the vector will contain sufficiently 
long regions of homology to sequences of the chromosome to allow complementary binding 
and incorporation of the vector into the chromosome. Longer regions of homology, and 
greater degrees of sequence similarity, may increase the efficiency of homologous 
5 recombination. 



A DNA "coding sequence" is a double-stranded DNA sequence which is transcribed and 
translated into a polypeptide in a cell in vitro or in vivo when placed under the control of 
appropriate regulatory sequences. The boundaries of the coding sequence are determined by 
a start codon at the 5' (amino) terminus and a translation stop codon at the 3' (carboxyl) 
10 terminus. A coding sequence can include, but is not limited to, prokaryotic sequences, 

cDNA from eukaryotic mRNA, genomic DNA sequences from eukaryotic {e.g., mammalian) 
DMA, and even synthetic DNA sequences. If the coding sequence is intended for expression 
in a eukaryotic cell, a polyadenylation signal and transcription termination sequence will 
usually be located 3' to the coding sequence. 

1 5 "Transcriptional and translational control sequences" are DNA regulatory sequences, such as 
promoters, enhancers, terminators, and the like, that provide for the expression of a coding 
sequence in a host cell. In eukaryotic cells, polyadenylation signals are control sequences. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA polymerase in a 
cell and initiating transcription of a downstream (3' direction) coding sequence. For 

20 purposes of defining the present invention, the promoter sequence is bounded at its 3' 

terminus by the transcription initiation site and extends upstream (5' direction) to include the 
minimum number of bases or elements necessary to initiate transcription at levels detectable 
above background. Within the promoter sequence will be found a transcription initiation site 
(conveniently defined for example, by mapping with nuclease SI), as well as protein binding 

25 domains (consensus sequences) responsible for the binding of RNA polymerase. 

A coding sequence is "under the control" of transcriptional and translational control 
sequences in a cell when RNA polymerase transcribes the coding sequence into mRNA, 
which is then trans-RNA spliced and translated into the protein encoded by the coding 
sequence. 
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A "signal sequence" is included at the beginning of the coding sequence of a protein to be 
expressed on the surface of a cell. This sequence encodes a signal peptide, N-terminal to the 
mature polypeptide, that directs the host cell to translocate the polypeptide. The term 
"translocation signal sequence" is used herein to refer to this sort of signal sequence. 
5 Translocation signal sequences can be found associated with a variety of proteins native to 
eukaryotes and prokaryotes, and are often functional in both types of organisms. 

As used herein, the term "sequence homology" in all its grammatical forms refers to the 
relationship between proteins that possess a "common evolutionary origin," including 
proteins from superfamilies (e.g., the immunoglobulin superfamily) and homologous proteins 
10 from different species (e.g., myosin light chain, etc.) (Reeck et al, 1987, Cell 50:667). 

Accordingly, the term "sequence similarity" in all its grammatical forms refers to the degree 
of identity or correspondence between nucleic acid or amino acid sequences of proteins that 
do not share a common evolutionary origin (see Reeck et aL, supra). However, in common 
usage and in the instant application, the term "homologous," when modified with an adverb 
1 5 such as "highly," may refer to sequence similarity and not a common evolutionary origin. 

In a specific embodiment, two DNA sequences are "substantially homologous" or 
"substantially similar" when at least about 50% (preferably at least about 75%, and most 
preferably at least about 90 or 95%) of the nucleotides match over the defined length of the 
DNA sequences. Sequences that are substantially homologous can be identified by 
20 comparing the sequences using standard software available in sequence data banks, or in a 
Southern hybridization experiment under, for example, stringent conditions as defined for 
that particular system. Defining appropriate hybridization conditions is within the skill of the 
art. See, e.g., Maniatis et aL, supra\ DNA Cloning, Vols. I & II, supra: Nucleic Acid 
Hybridization, supra. 

25 Similarly, in a particular embodiment, two amino acid sequences are "substantially 
homologous" or "substantially similar" when greater than 30% of the amino acids are 
identical, or greater than about 60% are similar (functionally identical). Preferably, the 
similar or homologous sequences are identified by alignment using, for example, the GCG 
(Genetics Computer Group, Program Manual for the GCG Package, Version 7, Madison, 

30 Wisconsin) pileup program. 
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The term "corresponding to" is used herein to refer similar or homologous sequences, - 
whether the exact position is identical or different from the molecule to which the similarity 
or homology is measured. Thus, the term "corresponding to" refers to the sequence 
similarity, and not the numbering of the amino acid residues or nucleotide bases. 



5 /'ENTPs: Proteins and Polypeptides 

The present invention provides isolated /ENTPs and active fragments thereof. An /ENTP is a 
transmembrane protein that contains approximately 450 amino acid residues, 8 to 12 putative 
transmembrane domains, is NBMPR insensitive, and functions as an equilibrative nucleoside 
transport protein. /ENTPs represent one class of five, or more, classes of nucleoside 

10 transporters found in mammalian cells (Griffith et al, 1996, supra: Cassrl995, supra). In 
preferred embodiments the iENTP is a mammalian protein. In one embodiment the /ENTP is 
encoded by a nucleotide sequence having at least 80% similarity with the coding sequence of 
SEQ ID NO: 1 . In another embodiment the /'ENTP is encoded by a nucleotide sequence 
having at least 80% identity with the coding sequence of SEQ ID NO: 1 . In still another 

1 5 embodiment the /ENTP has an amino acid sequence of SEQ ID NO:2 comprising one or 

more conservative substitutions thereof. In a preferred embodiment the isolated /ENTP is the 
human homoiog (hENT2) having an amino acid sequence of SEQ ID NO:2. The /ENTPS of 
the present invention may be used in assays to identify novel drugs, and the like, and in 
protein structure and mechanistic studies. 

20 The hENT2 protein is 50% identical (having 69% similarity) to the hENTl protein, the 

human homologue of the NBMPR-sensitive equilibrative nucleoside transport protein. As 
found for the concentrative transporters, hENTl and hENT2 do not share significant 
homology with other known membrane transport proteins, and appear to represent a new 
family of transport proteins. 

25 Surprisingly, the carboxy- terminal portion of the hENT2 protein is nearly identical to a 326 
residue predicted peptide (hHNP36) in the Genbank database that has been identified as 
growth factor-induced "delayed early response" gene of unknown function [Williams et al 
Biochem.Biophy.Res.Comrrt 213:325-333 (1995)]. Inspection of the hHNP36 nucleotide 
sequence revealed two potential open reading frames with hHNP36 translated from the 
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second start codon. While hENT2 also has two potential start codons, they are withinthe 
same open reading frame. The full length cDNA of hHNP36 (2281 bp) is nearly identical to 
hENT2, but contains a 68 bp deletion beginning at position 338. This deletion shifts the 
initial reading frame relative to hENT2 and would result in a truncated 22 Kd peptide with 
5 only 5 1 % identity to the hE>JT2 protein. Transient transfection studies with full length 
hENT2 and a 5'-truncated construct that lacks the first start codon (predicted protein 99% 
identical to hHNP36) demonstrated that a functional nucleoside transport protein is not 
produced from the second start codon. These data indicate that the hHNP36 peptide appears 
to be a truncated, non-functional form of hENT2. 

10 The high degree of homology between hENT2 and hHNP36 was completely unexpected, and 
could not have been predicted from the earlier work of Williams et al [Williams ei al, 1995, 
supra], HNP36 had been identified as a 36 Kd peptide by in vitro translation of both the 
human and mouse mRNA homologs, but no data was provided regarding its function. It was 
only reported that HNP36 was localized in the nucleolus, as determined by immunostaining 

15 studies in mouse cells [Williams et aL, 1995, supra]. Therefore, knowledge of the 36 Kd 

peptide in the absence of the teachings provided by the present invention, could not have led 
to the /EN TPs of the present invention. 

The possible identity of hENT2 with a full-length, in frame natural analog of hHNP36 is 
consistent with the i'ENTP being a "delayed early response" gene. When quiescent cells are 
20 stimulated to proliferate there is a sequential expression of cellular genes whose products are 
thought to mediate the long-term responses to the growth factors. Therefore /ENTPs are 
likely to play an important role in the proliferative response, when quiescent cells are 
stimulated into cycle by growth factors. 

Modified iENTPs: The present invention also provides active fragments of the /ENTPs and 
25 modified /ENTPs of the present invention, such as tagged proteins, labeled proteins, fusion 
proteins and the like. Such modified /ENTPs may be used for example as antigens or for 
marker purposes. In a particular embodiment of this type, the fusion protein comprises an 
/ENTP protein or active fragment thereof having an amino acid sequence of SEQ ID NO:2 or 
SEQ ID NO:2 comprising a conservative substitution thereof Modified /ENTPs of the 
30 present invention retain their activity as NBMPR insensitive equilibrative nucleoside 
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transport proteins. One particular use of the /ENTP fusion proteins of the present invention 
is for the production of the /ENTP-antibodies of the present invention. 

An /ENTP fusion protein comprises at least a portion of a non-/ENTP protein joined via a 
peptide bond to at least a portion of an /ENTP polypeptide. In preferred embodiments the 
5 portion of die /ENTP is functional. The noiw'ENTP sequences can be amino- or earboxy- 
terminal to the /ENTP sequences. More preferably, for stable expression of a proteolytically 
inactive /ENTP fusion protein, the portion of the non-ZENTP fusion protein is joined via a 
peptide bond to the amino terminus of the /ENTP protein. A recombinant DNA molecule 
encoding such a fusion protein comprises a sequence encoding at least a portion of a non- 

10 /ENTP protein joined in-frame to the /ENTP coding sequence, and preferably encodes a 
cleavage site for a specific protease, e.g., thrombin or Factor Xa, preferably at the /ENTP- 
, non-ZENTP juncture. In a specific embodiment the fusion protein is expressed in 
Escherichia coli. Such a fusion protein can be used to isolate the /ENTPs of the present 
invention, through the use of an affinity column which is specific for the protein fused to the 

1 5 /ENTP. The purified iENTP may then be released from the fusion protein through the use of 
a proteolytic enzyme and the cleavage site such as has been referred to above. 

In one such embodiment a chimeric /ENTP can be prepared, e.g., a glutathione-S-transferase 
(GST) fusion protein, a maltose-binding (MBP) protein fusion protein, or a poly-histidine- 
tagged fusion protein, for expression in a eukaryotic cell. Expression of an /ENTP as a 

20 fusion protein can facilitate stable expression, or allow for purification based on the 

properties of the fusion partner. For example, GST binds glutathione conjugated to a solid 
support matrix, MBP binds to a maltose matrix, and poly-histidine chelates to a Ni-chelation 
support matrix. The fusion protein can be eluted from the specific matrix with appropriate 
buffers, or by treating with a protease specific for a cleavage site usually engineered between 

25 the /ENTP and the fusion partner (e.g., GST, MBP, or poly-His) as described above. 

Alternatively the chimeric /ENTP protein may contain the green fluorescent protein, and be 
used to determine the intracellular localization of the /ENTP in the cell. 

Genes Encoding /ENTPs 

30 The present invention contemplates isolation of a gene encoding an /ENTP of the present 
invention, including a full length, or naturally occurring form of /ENTP, and antigenic 
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fragments thereof from any animal, particularly mammalian, and more particularly human, 
source. Such nucleic acids may be used for designing primers for RT-PCR, and for making 
probes that are useful for determining. the expression of iENTP messenger RNA in tissues 
and tumors. Similarly such nucleic acids can be used to determine the expression of iENTP 
5 messenger RNA in normal tissues and tumors by Northern Blot analysis, RNA protection 
assays and the like. As used herein, the term "gene" refers to an assembly of nucleotides that 
encode a polypeptide, and includes cDNA and genomic DNA nucleic acids. 

The present invention provides the primary structure of genes encoding /ENTPs as 
exemplified in Figure 6 and Figure 7. Furthermore, the present invention provides the 
10 genetic information that allows the determination of tissue specific regulatory elements of 
genes encoding the /ENTPs of the present invention. Such regulatory elements may be 
contained in SEQ ID NO:6. 

In accordance with the present invention there may be employed conventional molecular 
biology, microbiology, and recombinant DNA techniques within the skill of the art. Such 

15 techniques are explained fully in the literature. See, e.g., Sambrook, Fritsch & Maniatis, 
Molecular Cloning: A Laboratory Manual \ Second Edition (1989) Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (herein "Sambrook et aL. 1989"); DNA 
Cloning: A Practical Approach, Volumes I and II (D.N. Glover ed. 1985); Oligonucleotide 
Synthesis (M.J. Gait ed. 1984); Nucleic Acid Hybridization [B.D. Hames & S.J. Higgins eds. 

20 (1985)]; Transcription And Translation [B.D. Hames & S.J. Higgins, eds. (1984)]; Animal 
Cell Culture [R.l. Freshney, ed. (1986)]; Immobilized Cells And Enzymes [IRL Press, 
( 1 986)]; B. Perbal, A Practical Guide To Molecular t Zoning ( 1 984); F.M. Ausubel et al 
(eds.), Current Protocols in Molecular Biology, John Wiley & Sons, Inc. ( 1 994). 

A gene encoding iENTP, whether genomic DNA or cDNA, can be isolated from any source, 
25 particularly from a human cDNA or genomic library. In view and in conjunction with the 
present teachings, methods well known in the art. as described above can be used for 
obtaining iENTP genes from any source (see t e.g., Sambrook et al. y 1989, supra). 

Accordingly, any animal cell or transformed animal cell line potentially can serve as the 
nucleic acid source for the molecular cloning of a iENTP gene. The DNA may be obtained 
30 by standard procedures known in the art from cloned DNA (e.g., a DNA "library"), and 
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preferably is obtained from a cDNA library prepared from tissues with high level expression 
of the protein, by chemical synthesis, by cDNA cloning, or by the cloning of genomic DNA, 
or fragments thereof, purified from the desired cell (See, for example, Sambrook etal, 1989, 
supra; Glover, D.M. (ed.), 1985, DNA Cloning: A Practical Approach, MRL Press, Ltd., 
.5 Oxford, U.K. Vol. I, II). Clones derived from genomic DNA may contain regulatory and 
intron DNA regions in addition to coding regions; clones derived from cDNA will not 
contain intron sequences. Whatever the source, the gene should be molecuiarly cloned into a 
suitable vector for propagation of the gene. 

In the molecular cloning of the gene from genomic DNA, DNA fragments are generated, 
10 some of which will encode the desired gene. The DNA may be cleaved at specific sites using 
various restriction enzymes. Alternatively, one may use DNAse in the presence of 
manganese to fragment the DNA, or the DNA can be physically sheared, as for example, by 
sonication. The linear DNA fragments can then be separated according to size by standard 
techniques, including but not limited to, agarose and polyacrylamide gel electrophoresis and 
15 column chromatography. 

Once the DNA fragments are generated, identification of the specific DNA fragment 
containing the desired iENTP gene may be accomplished in a number of ways. For example, 
the generated DNA fragments may be screened by nucleic acid hybridization to a labeled 
probe of the present invention (Benton and Davis, 1977, Science 196:180; Grunstein and 

20 Hogness, 1975, Proc. Natl. Acad. Sci. U.S.A. 72:3961). For example, a set of 

oligonucleotides corresponding to the sequence information provided by the present 
invention can be prepared and used as probes for DNA encoding /ENTP (e.g., in combination 
with a poly-T primer for RT-PCR). Preferably, a probe is selected that is highly unique to 
/ENTP of the invention. Those DNA fragments with substantial homology to the probe will 

25 hybridize. As noted above, the greater the degree of homology, the more stringent 
hybridization conditions can be used. 

Further selection can be carried out on the basis of the properties of the gene, e.g., if the gene 
encodes a protein product having the isoelectric, electrophoretic, amino acid composition, or 
partial amino acid sequence of the /ENTP as disclosed herein. Thus, the presence of the gene 
30 may be detected by assays based on the physical, chemical, or immunological properties of 
its expressed product. For example, cDNA clones, or DNA clones which hybrid-select the 
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proper mRNAs, can be selected which produce a protein that, e.g.^ has similar or identical 
eiectrophoretic migration, isoelectric focusing or non-equilibrium pH gel electrophoresis 
behavior, proteolytic digestion maps, or antigenic properties as known for /ENTP. 



An iENTP gene of the invention can also be identified by mRNA selection, i.e., by nucleic 
5 acid hybridization followed by in vitro translation. In this procedure, nucleotide fragments 
are used to isolate complementary mRNAs by hybridization. Such DNA fragments may 
represent available, purified iENTP DNA, or may be synthetic oligonucleotides designed 
from the partial amino acid sequence information, lmmunoprecipitation analysis or 
functional assays (e.g.* nucleoside transport activity) of the in vitro translation products of 
10 the products of the isolated mRNAs identifies the mRNA and, therefore, the complementary 
DNA fragments, that contain the desired sequences. In addition, specific mRNAs may be 
selected by adsorption of polysomes isolated from cells to immobilized antibodies 
specifically directed against /ENTP. 

A radiolabeled iENTP cDNA can be synthesized using the selected mRNA (from the 
15 adsorbed polysomes) as a template. The radiolabeled mRNA or cDNA may then be used as a 
probe to identify homologous iENTP DNA fragments from among other genomic DNA 
fragments. 



The present invention also relates to cloning vectors containing genes encoding analogs and 
derivatives of /ENTP of the invention, that have the same or homologous functional activity 
20 as /ENTP, and homologs thereof from other species. The production and use of derivatives 
and analogs related to /ENTP are within the scope of the present invention. In a specific 
embodiment, the derivative or analog is functionally active, i.e., capable of exhibiting 
nucleoside transport activity. 

/ENTP derivatives can be made by altering encoding nucleic acid sequences by substitutions, 
25 additions or deletions that provide for functionally equivalent molecules. Preferably, 

derivatives are made that have enhanced or increased functional activity or greater specificity 
with regard to a particular pcrmeant relative to native /ENTP. Alternatively, such derivatives 
may encode soluble fragments of /ENTP extracellular domain that have the same or greater 
affinity for the natural permeants of the /ENTPs of the present invention. Such soluble 
30 derivatives also may be potent inhibitors of the nucleoside transport activity of the /ENTP. 



• 
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Due to the degeneracy of nucleotide coding sequences, other DNA sequences which encode 
substantially the same amino acid sequence as a iENTP gene may be used in the practice of 
the present invention. These include but are not limited to allelic genes, homologous genes 
from other species, and nucleotide sequences comprising all or portions of iENTP genes 

.5 which are altered by the substitution of different codons that encode the same amino acid 
residue within the sequence, thus producing a silent change. Likewise, the /ENTP 
derivatives of the invention include, but are not limited to, those containing, as a primary 
amino acid sequence, all or part of the amino acid sequence of a /ENTP protein including 
altered sequences in which functionally equivalent amino acid residues are substituted for 

1 0 residues within the sequence resulting in a conservative amino acid substitution. Such 

alterations define the term "a conservative substitution" as used herein. For example, one or 
more amino acid residues within the sequence can be substituted by another amino acid of a 
similar polarity, which acts as a functional equivalent, resulting in a silent alteration. 
Substitutes tor an amino acid within the sequence may be selected from other members of the 

1 5 class to which the amino acid belongs. For example, the nonpolar (hydrophobic) amino acids 
include alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan and 
methionine. Amino acids containing aromatic ring structures are phenylalanine, tryptophan, 
and tyrosine. The polar neutral amino acids include glycine, serine, threonine, cysteine, 
tyrosine, asparagine, and glutamine. The positively charged (basic) amino acids include 
20 arginine, lysine and histidine. The negatively charged (acidic) amino acids include aspartic 
acid and glutamic acid. Such alterations will not be expected to affect apparent molecular 
weight as determined by polyacrylamide gel electrophoresis, or isoelectric point. 

Particularly preferred substitutions are: 

- Lys for Arg and vice versa such that a positive charge may be maintained; 
25 - Glu for Asp and vice versa such that a negative charge may be maintained; 

- Ser for Thr such that a free -OH can be maintained; and 

- Gin for Asn such that a free NH 2 can be maintained. 

Amino acid substitutions may also be introduced to substitute an amino acid with a 
particularly preferable property. For example, a Cys may be introduced at a potential site for 
30 disulfide bridges with another Cys. Pro may be introduced because of its particularly planar 
structure, which induces p-turns in the protein's structure. 
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The genes encoding /ENTP derivatives and analogs of the invention can be produced by 
various methods known in the art. The manipulations which result in their production can 
occur at the gene or protein level. For example, the cloned iENTP gene sequence can be 
modified by any of numerous strategies known in the art (Sambrook et ah, 1989, supra). The 
5 sequence can be cleaved at appropriate sites with restriction endonuclease(s), followed by 
further enzymatic modification if desired, isolated, and ligated in vitro. In the production of 
the gene encoding a derivative or analog of /ENTP, care should be taken to ensure that the 
modified gene remains within the same translational reading frame as the iENTP gene, 
uninterrupted by translational stop signals, in the gene region where the desired activity is 
1 0 encoded. 



Additionally, the iENTP-encoding nucleic acid sequence can be mutated in vitro or in vivo, 
to create and/or destroy translation, initiation, and/or termination sequences, or to create 
variations in coding regions and/or form new restriction endonuclease sites or destroy 
preexisting ones, to facilitate further in vitro modification. Preferably, such mutations 

1 5 enhance the functional activity or specificity for a particular permeant, of the mutated iENTP 
gene product. Any technique for mutagenesis known in the art can be used, including but not 
limited to, in vitro site-directed mutagenesis (Hutchinson, C, et ai, 1978, J. Biol. Chem. 
253:6551; Zoller and Smith, 1984, DNA 3:479-488; Oiiphant et al, 1986, Gene 44:177; 
Hutchinson et al y 1986, Proc. Natl. Acad. Sci. U.S.A. 83:710), use of TAB® linkers 

20 (Pharmacia), etc. PCR techniques are preferred for site directed mutagenesis (see Higuchi, 
1989, "Using PCR to Engineer DNA", in PCR Technology: Principles and Applications for 
DNA Amplification, H. Erlich, ed., Stockton Press. Chapter 6, pp. 61-70). 

The identified and isolated gene can then be inserted into an appropriate cloning vector. A 
large number of vector-host systems known in the art may be used. Possible vectors include, 

25 but are not limited to, plasm ids or modified viruses, but the vector system must be 

compatible with the host cell used. Examples of vectors include, but are not limited to, E. 
coli y bacteriophages such as lambda derivatives, or plasmids such as pBR322 derivatives or 
pUC plasm id derivatives, e.g., pGEX vectors, pmal-c, pFLAG, etc. The insertion into a 
cloning vector can, for example, be accomplished by ligating the DNA fragment into a 

30 cloning vector which has complementary cohesive termini. However, if the complementary 
restriction sites used to fragment the DNA arc not present in the cloning vector, the ends of 
the DNA molecules may be enzymatically modified. Alternatively, any site desired may be 
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produced by ligating nucleotide sequences (linkers) onto the DNA termini: these li gated 
linkers may comprise specific chemically synthesized oligonucleotides encoding restriction 
endonuclease recognition sequences. Recombinant molecules can be introduced into host 
cells via transduction, transformation, trans feet ion, infection, electroporation. etc., so that 
5 many copies of the gene sequence are generated. Preferably, the cloned gene is contained on 
a shuttle vector plasmid, which provides for expansion in a cloning cell, e.g., E. coli, and 
facile purification for subsequent insertion into an appropriate expression cell line, if such is 
desired. For example, a shuttle vector, which is a vector that can replicate in more than one 
type of organism, can be prepared for replication in both E. coli and Saccharomyces 
10 cerevisiae by linking sequences from an E. coli plasmid with sequences from the yeast 2u 
plasmid. 

In an alternative method, the desired gene may be identified and isolated after insertion into a 
suitable cloning vector in a "shot gun" approach. Enrichment for the desired gene, for 
example, by size fractionation, can be done before insertion into the cloning vector. 

15 Expression of iENTP 

The present invention provides for expressing the nucleic acids which encode the /ENTPs 
active fragments thereof, derivatives or analog thereof, or a functionally active derivative, 
including a chimeric protein, thereof, that has been inserted into an appropriate expression 
vector, /.<?.«, a vector which contains the necessary elements for the transcription and 

20 translation of the inserted protein-coding sequence. Such elements are termed herein a 
"promoter." Thus, the nucleic acid encoding an /ENTP of the invention is operationally 
associated with a promoter in an expression vector of the invention. Both cDNA and 
genomic sequences can be cloned and expressed under control of such regulatory sequences. 
An expression vector also preferably includes a replication origin. One particular use for 

25 such expression vectors is to express an i'ENTP in large quantities that can be used for 
functional and structural studies of the purified transport protein. 

The necessary transcriptional and translational signals can be provided on a recombinant 
expression vector, or they may be supplied by the native gene encoding /ENTP and/or its 
flanking regions. 
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Potential chimeric partners for the /ENTP of the present invention include substitute lectin 
domains, either from naturally occurring multivalent lectin receptors, such as mannose 
receptor of macrophages, natural lectins, or other sources. 

Potential host-vector systems include but are not limited to mammalian cell systems, infected 
5 with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell systems infected with virus (e.g., 
baculovirus); microorganisms such as yeast containing yeast vectors; or bacteria transformed 
with bacteriophage, DNA, plasmid DNA, or cosmid DNA. The expression elements of 
vectors vary in their strengths and specificities. Depending on the host-vector system 
utilized, any one of a number of suitable transcription and translation elements may be used. 

10 A recombinant /ENTP protein of the invention, or functional fragment, derivative, chimeric 
construct, or analog thereof, may be expressed chromosomally, after integration of the 
coding sequence by recombination. In this regard, any of a number of amplification systems 
may be used to achieve high levels of stable gene expression (See Sambrook et aL, 1 989, 
supra). 

1 5 The cell containing the recombinant vector comprising the nucleic acid encoding /ENTP is 
cultured in an appropriate cell culture medium under conditions that provide for expression 
of /ENTP by the cell. 

Any of the methods previously described for the insertion of DNA fragments into a cloning 
vector may be used to construct expression vectors containing a gene consisting of 
20 appropriate transcriptional/transJational control signals and the protein coding sequences. 
These methods may include in vitro recombinant DNA and synthetic techniques and in vivo 
recombination (genetic recombination). 

Expression of /ENTP may be controlled by any promoter/enhancer element known in the art, 
but these regulatory elements must be functional in the host selected for expression. 
25 Promoters which may be used to control iENTP gene expression include, those described in 
Example 1 below, as well as the SV40 early promoter region (Benoist and Chambon, 1981, 
Nature 290:304-310), the promoter contained in the 3' long terminal repeat of Rous sarcoma 
virus (Yamamoto, et al, 1980, Cell 22:787-797). the herpes thymidine kinase promoter 
(Wagner etal., 1981, Proc. Natl. Acad. Sci. U.S.A. 78: 144] -1445), the regulatory sequences 
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of the inetallothionein gene (Brinster et ai, 1982, Nature 296:39-42); prokaryotic expression 
vectors such as the P-lactamase promoter (Villa-Kamaroff, et al, 1978, Proc. Natl. Acad. 
Sci. U.S.A. 75:3727-3731), orthetac promoter (De Boer, et aL, 1983, Proc.Natl. Acad. Sci. 
U.S.A. 80:21-25); sec also "Useful proteins from recombinant bacteria" in Scientific 
5 American, 1980, 242:74-94; promoter elements from yeast or other fungi such as the Gal 4 
promoter, the ADC (alcohol dehydrogenase) promoter. PGK (phosphoglycerol kinase) 
promoter, alkaline phosphatase promoter; and the animal transcriptional control regions, 
which exhibit tissue specificity and have been utilized in transgenic animals: elastase 1 gene 
control region which is active in pancreatic acinar cells (Swift et uL 9 1984, Cell 38:639-646; 

10 Ornitz et al, 1986, Cold Spring Harbor Symp. Quant. Biol. 50:399-409; MacDonald, 1987, 
Hepatology 7:425-515); insulin gene control region which is active in pancreatic beta cells 
(Hanahan, 1985, Nature 315:1 15-122), immunoglobulin gene control region which is active 
in lymphoid cells (Grosschedl etal, 1984, Cell 38:647-658; Adames et a/., 1985, Nature 
318:533-538; Alexander etaL t 1987, Mol. Cell. Biol. 7:1436-1444), mouse mammary tumor 

1 5 virus control region which is active in testicular, breast, lymphoid and mast cells (Leder et 
aL, 1986, Cell 45:485-495), albumin gene control region which is active in liver (Pinkert et 
al, 1987, Genes and Dcvcl. 1 :268-276), alpha-fetoprotein gene control region which is active 
in liver (Krumlauf et al> 1985, Mol. Cell. Biol. 5: 1 639- 1 648: Hammer et al , 1 987, Science 
235:53-58), alpha 1 -antitrypsin gene control region which is active in the liver (Kelsey et ai., 

20 1 987. Genes and Dcvcl. 1:161-171), beta-globin gene control region which is active in 

myeloid cells (Mograrn et al, 1985, Nature 3 15:338-340; Kollias et al, 1986, Ceil 46:89-94), 
myelin basic protein gene control region which is active in oligodendrocyte cells in the brain 
(Readhead et al 9 1987, Cell 48:703-712), myosin light chain-2 gene control region which is 
active in skeletal muscle (Sani, 1985, Nature 3 14:283-286), and gonadotropic releasing 

25 hormone gene control region which is active in the hypothalamus (Mason et ah % 1986, 
Science 234:1372-1378). 



Expression vectors containing a nucleic acid encoding an /ENTP of the invention can be 
identified by four general approaches: (a) PCR amplification of the desired plasmid DNA or 
specific mRNA, (b) nucleic acid hybridization, (c) presence or absence of selection marker 
30 gene functions, and (d) expression of inserted sequences. In the first approach, the nucleic 
acids can be amplified by PCR to provide for detection of the amplified product. In the 
second approach, the presence of a foreign gene inserted in an expression vector can be 
detected by nucleic acid hybridization using probes comprising sequences that are 
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homologous to an inserted marker gene. In the third approach, the recombinant vector/host 
system can be identified and selected based upon the presence or absence of* certain 
"selection marker" gene functions (e.g., P-galaetosidasc activity, thymidine kinase activity, 
resistance to antibiotics, transformation phenotype, occlusion body formation in baculovirus, 
5 etc.) caused by the insertion of foreign genes in the vector. In another example, if the nucleic 
acid encoding /ENTP is inserted within the "selection marker" gene sequence of the vector, 
recombinants containing the iENTP insert can be identified by the absence of the iENTP gene 
function. In the fourth approach, recombinant expression vectors can be identified by 
assaying for the activity, biochemical, or immunological characteristics of the gene product 
10 expressed by the recombinant, provided that the expressed protein assumes a functionally 
active conformation. This last approach has been used in Example I, below. 



A wide variety of host/expression vector combinations may be employed in expressing the 
DMA sequences of this invention. Useful expression vectors, for example, may consist of 
segments of chromosomal, non-chromosomal and synthetic DNA sequences. Suitable 

15 vectors include derivatives of SV40 and known bacterial plasmids, e.g., E. coli plasmids col 
El, pCRl, pBR322, pMal-C2, pET, pGEX (Smith et aL, 1988, Gene 67:3 1-40), pMB9 and 
their derivatives, plasmids such as RP4; phage DNAS, e.g., the numerous derivatives of 
phage k, e.g., NM989, and other phage DNA. e.g., MI 3 and filamentous single stranded 
phage DNA; yeast plasmids such as the 2u plasmid or derivatives thereof; vectors useful in 

20 eukaryotic cells, such as vectors useful in insect or mammalian cells; vectors derived from 
combinations of plasmids and phage DNAs. such as plasmids that have been modified to 
employ phage DNA or other expression control sequences; and the like. 



For example, in a baculovirus expression systems, both non-fusion transfer vectors, such as 
but not limited to pVL941 (BamH\ cloning site; Summers), pVL1393 (BamhW , Sma\ t Xba\, 

25 EcoR\ , M?/I, XmalU, BgM, and Pst\ cloning site; lnvitrogen), pVL1392 (BgM, Pstl, Not\, 
Xmalll, EcoRl, Xbal, Smal, and BarnH] cloning site; Summers and lnvitrogen), and 
pBlue^acIII (BamRltBgM, Pst\ y NcoU and HindlU cloning site, with blue/white 
recombinant screening possible; lnvitrogen), and fusion transfer vectors, such as but not 
limited to pAc700 (BamHl and Kpn\ cloning site, in which the BamH\ recognition site 

30 begins with the initiation codon; Summers), pAc701 and pAc702 (same as pAc700, with 
different reading frames), pAc360 (BamU\ cloning site 36 base pairs downstream of a 
polyhedrin initiation codon: lnvitrogen(195)) r and pBIueBacHisA, B, C (three different 
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reading frames, with BamHI, BgllL Ps1\* Nco\, and Hindlll cloning site, an N-terminal 
peptide for ProBond purification, and blue/white recombinant screening of plaques; 
Invitrogen (220)) can be used. 

Mammalian expression vectors contemplated for use in the invention include vectors with 
5 inducible promoters, such as the dihydrofolate reductase (DHFR) promoter, e.g., any 

expression vector with a DHFR expression vector, or a ZW/Wmethotrexate co-amplification 
vector, such as pED (Psil, Sail, Sbal, Sma\, and EcoKX cloning site, with the vector 
expressing both the cloned gene and DHFR; see Kaufman, Current Protocols in Molecular 
Biology, 16.12 (1991). Alternatively, a glutamine synthetase/methionine sulfoximine co- 

10 amplification vector, such as pEE14 (Hindl\],Xba], SmaL Sbal, EcoRL and Bell cloning site, 
in which the vector expresses glutamine synthase and the cloned gene; Celltech). In another 
embodiment, a vector that directs episomal expression under control of Epstein Barr Virus 
(EBV) can be used, such as pREP4 (BamHI , Sfil, Xhol, Noil, Nhel, HindllL NheU PvuU, and 
Kpnl cloning site, constitutive RSV-LTR promoter, hygromycin selectable marker; 

1 5 Invitrogen), pCEP4 {BamHX , Sfil, Xhol 9 Not\ y NheX, Hindlll, Nhel, Pvull, and Kpnl cloning 
site, constitutive hCMV immediate early gene, hygromycin selectable marker; Invitrogen), 
pMEP4 (Kpnl, Pvu], Nhel, HindlU, Noll, Xhol, Sfil, BamH] cloning site, inducible 
methallothionein I la gene promoter, hygromycin selectable marker: Invitrogen), pREP8 
{BamHI, Xhol, Noil, Hindlll, Nhel, and Kpnl cloning site, RSV-LTR promoter, histidinol 

20 selectable marker; Invitrogen), pREP9 (Kpnl, Nhel, Hindlll, Noil, Xhol, Sfi], and BamHI 

cloning site, RSV-LTR promoter, G41 8 selectable marker, Invitrogen), and pEBVHis (RSV- 
LTR promoter, hygromycin selectable marker, N-terminal peptide puriflable via ProBond 
resin and cleaved by enterokmase; Invitrogen). Selectable mammalian expression vectors for 
use in the invention include pRc/CMV (Hindlll, BstXX, Noil, Sbdl, and Apal cloning site, 

25 G4I8 selection; Invitrogen), pRc/RSV (HindlU, SpeX, BstXX, Noil, Xhal cloning site, G418 
selection; Invitrogen), and others. Vaccinia virus mammalian expression vectors (see, 
Kaufman, 1991 , supra) for use according to the invention include but are not limited to 
pSCl I (Smal cloning site, TK- and (*-gal selection), pMJ601 (Sail, SmaX, AfiU Narl, BspMU, 
BamHI, Apa\, Nhel, Sacll, Kpnl, and Hindlll cloning site; TK- and p-gal selection), and 

30 pTKgptFIS (EcoKl, Pstl, Sail, Accl, Hindi], Sba], BamHI, and Hpa cloning site, TK or 
XPRT selection). 
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Yeast expression systems can also be used according to the invention to express the z'ENTP 
protein. For example, the non-fusion pYES2 vector (Xbal y Sph\, Sho\, Noil. GstX], £coRI, 
BstXl, BamH\, Sad, Kpn\, and HirtMl cloning sit; Invitrogen) or the fusion pYESHisA, B, 
C (XbaU Sphl, ShoL NotU BstXL EcoRl. BamH I , Sacl, Kpnl, and Hirtdlll cloning site, N- 
5 terminal peptide purified with ProBond resin and cleaved with enterokinase; invitrogen), to 
mention just two, can be employed according to the invention. 

In a preferred embodiment the expression vector is pDR2 (Clonetech). In another preferred 
embodiment the expression vector is pcDNA3 (Invitrogen). 

Once a particular recombinant DNA molecule is identified and isolated, several methods 
1 0 known in the art may be used to propagate it. Once a suitable host system and growth 

conditions are established, recombinant expression vectors can be propagated and prepared 
in quantity. As previously explained, the expression vectors which can be used include, but 
are not limited to, the following vectors or their derivatives: human or animal viruses such 
as vaccinia virus or adenovirus; insect viruses such as baculovirus; yeast vectors; 
1 5 bacteriophage vectors (e.g., lambda), and plasm id and cosmid DNA vectors, to name but a 
few. 

In addition, a host cell strain may be chosen which modulates the expression of the inserted 
sequences, or modifies and processes the gene product in the specific fashion desired. 
Different host cells have characteristic and specific mechanisms for the transtational and 

20 post-translational processing and modification (e.g., glycosylation, cleavage [e.g. of signal 
sequence]) of proteins. Appropriate cell lines or host systems can be chosen to ensure the 
desired modification and processing of the foreign protein expressed. For example, 
expression in a bacterial system can be used to produce an nonglycosylated core protein 
product. However, the transmembrane /ENTP expressed in bacteria may not be properly 

25 folded. Expression in yeast can produce a glycosylated product. Expression in eukaryotic 
cells can increase the likelihood of "native" glycosylation and folding of a heterologous 
protein. Moreover, expression in mammalian cells can provide a tool for reconstituting, or 
constituting, i'ENTP activity. Furthermore, different vector/host expression systems may 
affect processing reactions, such as proteolytic cleavages, to a different extent. 
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Vectors are introduced into the desired host cells by methods known in the art, e.g., 
transfection, transduction, electroporation, microinjection, cell fusion, DEAE dextran, 
calcium phosphate precipitation, lipofection (lysosome fusion), use of a gene gun, or a DNA 
vector transporter (see, e.g., Wu et al. y 1992, J. Biol. Chem. 267:963-967; Wu and Wu, 1988, 
5 J. BioJ. Chem. 263: 14621-14624; Hartmut et al., Canadian Patent Application No. 2,012,3 1 1, 
filed March 15, 1990). 



The present invention also provides eel) lines made from cells transfected or transduced with 
the /ENTPs of the present invention. In preferred embodiments of this type the cells are 
mammalian cells. In one such embodiment the /ENTP is introduced into a COS-1 cell with a 

1 0 pcDNA3 expression vector, as exemplified below. The /ENTP expressed in a human cell 
line is preferentially hENT2. In one such embodiment, the human cell expresses hENT2 as 
its only detectable nucleoside transport protein. In a particular embodiment of this type, the 
human cell is the T-cell leukemia cell line CEM, transfected with an Epstein-Barr Nuclear 
Antigen 1 expression cassette with a pDR2 expression vector capable of supporting the 

15 episomal replication of an Epstein-Barr virus- based mammalian expression vector encoding 
the hENT2. Such a transfected cell expresses the /ENTP as its only nucleoside transport 
protein and is therefore a valuable tool for further characterization of the /ENTPs of the 
present invention. Heretofore, such characterization appeared to be an impossible task, as all 
known cell lines expressing an iENTP had also been shown to have one or more additional 

20 nucleoside transport activities. Such a cell line also is a valuable tool for determining 
whether antiviral and antitumor nucleoside analogs can enter cells via the ei transporter 
and/or for identifying specific inhibitors of /ENTPs as discussed below. In one particular 
embodiment of this type, the cell is an CEM/N1-7 cell exemplified below. 



Such a cell line also can be used in expression cloning of proteins using an episomally 
25 replicating Epstein-Barr virus based vector that requires the expression of EBNA-1 in trans, 
and requires nucleoside transport in the selection process (e.g. HAT selection {with adenine 
substituted for hypoxanthine since the line is HPRT deficient)). Such procedures and cell 
lines are especially useful for proteins that are expressed in larger quantities in a T-ccll 
background. 



30 



General Protein Purification Procedures : 
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The initial step for purifying the /ENTPs of the present invention, active fragments thereof, 
and related tagged or fusion proteins generally consists of lysing the cells containing the 
/ENTPs. Cell lysis can be achieved by a number of methods including through the use of a 
physical means such as a French press, a sonicator, or a blender; or through chemical means 
5 including enzymatic extractions (with for example, lysozyme or pancreatine and/or organic 
extractions or solubilizations with detergents, such as sodium dodecyl sulfate (SDS), Triton 
X-100, nonidet P-40 (NP-40), digoxin, sodium deoxycholate, and the like, including mixtures 
thereof; or through a combination of chemical and physical means. For example, 
solubilization can be enhanced by sonication of the suspension. Subsequent steps of 

10 purification include salting in or salting out, such as in ammonium sulfate fractionations; 
solvent exclusion fractionations, e.g^ an ethanol precipitation; detergent extractions to free 
the membrane bound lENTPs of the present invention using such detergents as Triton X-100, 
Tween-20 etc.; or high salt extractions. Solubilization of proteins may also be achieved 
using aprotic solvents such as dimethyl sulfoxide and hexamethylphosphoramide. In 

1 5 addition, high speed ultracentrifugation may be used either alone or in conjunction with other 
extraction techniques. 



Generally good secondary isolation or purification steps include solid phase absorption using 
calcium phosphate gel or hydroxyapatite; or solid phase binding. Solid phase binding may 
be performed through ionic bonding, with either an anion exchanger, such as 

20 diethylaminoethyl (DEAE), or diethyl [2- hydroxy propyl] aminoethyl (QAE) SEPHADEX or 
cellulose; or with a cation exchanger such as carboxymethyl (CM) or sulfopropyl (SP) 
SEPHADEX or cellulose. Alternative means of solid phase binding includes the exploitation 
of hydrophobic interactions e.g., the using of a solid support such as PHENYLSEPHAROSE 
and a high salt buffer; affinity- binding, using, e.g., placing a nucleoside or nucleoside analog 

25 on to an activated support; immuno-binding, using e.g, an antibody to an /"ENTP of the 
present invention bound to an activated support; as well as other solid phase supports 
including those that contain specific dyes or lectins etc. A further solid phase support 
technique that is often used at the end of die purification procedure relies on size exclusion, 
such as SEPHADEX and SEPHAROSE gels, or pressurized or centrifugal membrane 

30 techniques, using size exclusion membrane filters. 



Solid phase support separations are generally performed batch- wise with low-speed 
centrifugations or by column chromatography. High performance liquid chromatography 
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(HPLC), including such related techniques as FPLC, is presently the most common means of 
performing liquid chromatography. Size exclusion techniques may also be accomplished 
with the aid of low speed ccntrifugation. 

In addition size permeation techniques such as gel electrophoretic techniques may be 
5 employed. These techniques are generally performed in tubes, slabs or by capillary 
electrophoresis. 

Almost all steps involving protein purification employ a buffered solution. Unless otherwise 
specified, generally 25-100 mM concentrations are used. Low concentration buffers 
generally infer 5-25 mM concentrations. High concentration buffers generally infer 

1 0 concentrations of the buffering agent of between 0.1 -2M concentrations. Typical buffers can 
be purchased from most biochemical catalogues and include the classical buffers such as 
Tris, pyrophosphate, monophosphate and diphosphate. The Good buffers [Good, et aL y 
Biochemistry, 5:467 (1966); Good ei ah Meih. EnzymoL, 24: Part B, 53 (1972) ; and 
Fergunson, et. al Anal. Biochem. 104:300,(1980)] such as Mcs, Hepes, Mops, tricine and 

15 Ches. 

Materials to perform all of these techniques are available from a variety of sources such as 
Sigma Chemical Company in St. Louis, Missouri. 

Antibodies to /EM TPs 
According to the invention, an /ENTP obtained from a natural source or produced 

20 recombinantly or by chemical synthesis, and fragments or other derivatives or analogs 

thereof, including fusion proteins, may be used as an immunogen to generate antibodies that 
recognize the /ENTP polypeptide. Such antibodies include but are not limited to polyclonal, 
monoclonal, chimeric, single chain, Fab fragments, and an Fab expression library. The anti- 
/ENTP antibodies of the invention may be cross reactive, e.g., they may recognize an /ENTP 

25 from different species. Polyclonal antibodies have greater likelihood of cross reactivity. 
Alternatively, an antibody of the invention may be specific for a single form of the /ENTP, 
such as murine /ENTP. Preferably, such an antibody is specific for human /ENTP. 

Various procedures known in the art may be used for the production of polyclonal antibodies 
to an /ENTP of the present invention or derivative or analog thereof. For the production of 
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antibody, various host animals can be immunized by injection with an /ENTP or a derivative 
(e.g., fragment or fusion protein) thereof, including but not limited to rabbits, mice, rats, 
sheep, goats, etc. In one embodiment, an /ENTP or fragment thereof can be conjugated to an 
immunogenic carrier, e.g., bovine serum albumin (BSA) or keyhole limpet hemocyanin 
5 (KLH). Various adjuvants may he used to increase the immunological response, depending 
on the host species, including but not limited to Freund's (complete and incomplete), mineral 
gels such as aluminum hydroxide, surface active substances such as lyso lecithin, pluronic 
polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanins, dinitrophenol, and 
potentially useful human adjuvants such as BCG (bacille Calmette-Guerin) and 
1 0 Coryne bacterium parvum. 

For preparation of monoclonal antibodies directed toward an /ENTP of the present invention, 
or fragment, analog, or derivative thereof, any technique that provides for the production of 
antibody molecules by continuous cell lines in culture may be used. These include but are 
not limited to the hybridoma technique originally developed by Kohler and Milstein [Nature 

15 256:495-497 (1975)], as well as the trioma technique, the human B-cell hybridoma technique 
[Kozbor et al, Immunology Today 4:72 1983); Cote et al. 9 Proc. Natl Acad. Sci. U.S.A. 
80:2026-2030 (1983)J, and the EBV-hybridoma technique to produce human monoclonal 
antibodies [Cole ei al, in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., 
pp. 77-96 (1985)]. In an additional embodiment of the invention, monoclonal antibodies can 

20 be produced in germ-free animals utilizing recent technology [PCT/US90/02545]. In fact, 
according to the invention, techniques developed for the production of "chimeric antibodies" 
[Morrison et aL, J. Bacteriol. 159:870 (1984); Neuberger et al 9 Nature 312:604-608 (1984); 
Takeda et aL, Nature 314:452-454 (1 985)] by splicing the genes from a mouse antibody 
molecule specific for an /ENTP, for example, together with genes from a human antibody 

25 molecule of appropriate biological activity can be used; such antibodies are within the scope 
of this invention. Such human or humanized chimeric antibodies are preferred for use in 
therapy of human diseases or disorders (described infra), since the human or humanized 
antibodies are much less likely than xenogenic antibodies to induce an immune response, in 
particular an allergic response, themselves. 

30 According to the invention, techniques described tor the production of single chain 

antibodies [U.S. Patent Nos. 5,476,786 and 5,132,405 to Huston; U.S. Patent 4,946,778] can 
be adapted to produce /ENTP-specific single chain antibodies. An additional embodiment of 
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the invention utilizes the techniques described for the construction of Fab expression libraries 
fHuse et a!.. Science 246:1275-1281 (1989)] to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity for an /ENTP or its derivatives, or 
analogs. 

5 Antibody fragments which contain the idiotype of the antibody molecule can be generated by 
known techniques. For example, such fragments include but are not limited to: the F(ab') 2 
fragment which can be produced by pepsin digestion of the antibody molecule; the Fab' 
fragments which can be generated by reducing the disulfide bridges of the F(ab') 2 fragment, 
and the Fab fragments which can be generated by treating the antibody molecule with papain 
1 0 and a reducing agent. 

In the production of antibodies, screening for the desired antibody can be accomplished by 
techniques known in the art, e.g., radioimmunoassay, ELISA (enzyme- linked immunosorbant 
assay), "sandwich" immunoassays, immunoradiometric assays, gel diffusion precipitin 
reactions, immunodiffusion assays, in situ immunoassays (using colloidal gold, enzyme or 

15 radioisotope labels, for example), western blots, precipitation reactions, agglutination assays 
(e.g., gel agglutination assays, hemagglutination assays), complement fixation assays, 
immunofluorescence assays, protein A assays, and Immunoelectrophoresis assays, etc. In 
one embodiment, antibody binding is detected by detecting a label on the primary antibody. 
In another embodiment, the primary antibody is detected by detecting binding of a secondary 

20 antibody or reagent to the primary antibody. In a further embodiment, the secondary 
antibody is labeled. Many means are known in the art for detecting binding in an 
immunoassay and are within the scope of the present invention. For example, to select 
antibodies which recognize a specific epitope of an /ENTP, for example the predicted 
extracellular loop, amino acids 35-64 of SEQ IDNO:2, one may assay generated hybridomas 

25 for a product which binds to an /ENTP fragment containing such epitope. For selection of an 
antibody specific to an /ENTP from a particular species of animal, one can select on the basis 
of positive binding with an /ENTP expressed by or isolated from cells of that species of 
animal. 

Hie foregoing antibodies can be used in methods known in the art relating to the localization 
30 and activity of the /ENTP, e.g., for Western blotting, imaging /ENTP in siiu, measuring 

levels thereof in appropriate physiological samples, etc. using any of the detection techniques 
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mentioned above or known in the art. More particularly, the antibodies of the present 
invention can be used in flow cytometry studies, in immunohistochemical staining, and in 
immunoprecipitation which serves to aid the determination of the level of expression of an 
/ENTP in a tumor or normal cell or tissue. 

5 In a specific embodiment, antibodies that agonize or antagonize the activity of an /ENTP can 
be generated. Such antibodies can be tested using the assays described infra for identifying 
ligands. 

Labels 

Suitable labels include enzymes and proteins such as green fluorescent protein, fluorophores 
10 (e.g., fluorescein isothiocyanate (FITC), phycoerythrin (PE), Texas red (TR), rhodamine, 
free or chelated lanthanide series salts, especially Eu 3+ , to name a few fluorophores), 
chromophores, radioisotopes, chelating agents, dyes, colloidal gold, latex particles, ligands 
(e.g., biotin), and chemiluminescent agents. When a control marker is employed, the same or 
different labels may be used for the receptor and control marker. 

15 In the instance where a radioactive label, such as the isotopes 3 H, ,4 C, 32 P, 35 S, 36 CI, 3, Cr, 57 Co, 
58 Co, 59 Fe, 9a Y, l25 i. 131 1, and ,86 Re are used, known currently available counting procedures 
may be utilized. In the instance where the label is an enzyme, detection may be 
accomplished by any of the presently utilized colorimetric, spectrophotometric, 
fluorospectrophotometric, amperometric or gasometric techniques known in the art. 

20 Direct labels are one example of labels which can be used according to the present invention. 
A direct label has been defined as an entity, which in its natural state, is readily visible, either 
to the naked eye, or with the aid of an optical filter and/or applied stimulation, e.g. U.V. light 
to promote fluorescence. Among examples of colored labels, which can be used according to 
the present invention, include metallic sol particles, for example, gold sol particles such as 

25 those described by Leuvering (U.S. Patent 4,3 13,734); dye sole particles such as described by 
Gribnau et al (U.S. Patent 4,373,932) and May et al (WO 88/08534); dyed latex such as 
described by May, supra, Snyder (EP-A 0 280 559 and 0 281 327); or dyes encapsulated in 
liposomes as described by Campbell et al (U.S. Patent 4,703,017). Other direct labels 
include a radtonucleotide, a fluorescent moiety or a luminescent moiety. In addition to these 
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direct labelling devices, indirect labels comprising enzymes can also be used according to the 
present invention. Various types of enzyme linked immunoassays are well known in the art, 
for example, alkaline phosphatase and horseradish peroxidase, lysozyme, glucoses- 
phosphate dehydrogenase, lactate dehydrogenase, urease, these and others have been 
5 discussed in detail by Eva Engvall in Enzyme Immunoassay ELISA and EMIT in Methods in 
Enzymology, 70. 419-439, 1980 and in U.S. Patent 4,857,453. 

Suitable enzymes include, but are not limited to, alkaline phosphatase and horseradish 
peroxidase. 

Other labels for use in the invention include magnetic beads or magnetic resonance imaging 
1 0 labels. 

In another embodiment, a phosphorylation site can be created on an antibody of the invention 
for labeling with 32 P, e.g., as described in European Patent No. 0372707 (application No. 
893 1 1 108.8) by Sidney Pestka, or U.S. Patent No. 5,459,240, issued October 1 7. 1995 to 
Foxwell et ah 

1 5 As exemplified herein, proteins, including the /ENTPs of the present invention and 

antibodies thereto, can be labeled by metabolic labeling. Metabolic labeling occurs during in 
vitro incubation of the cells that express the protein in die presence of culture medium 
supplemented with a metabolic label, such as f 35 S]-methionine or [ 32 P]-orthophosphate. In 
addition to metabolic (or biosynthetic) labeling with [ 35 S]-methionine 3 the invention further 

20 contemplates labeling with f l4 CJ-amino acids and pHj-amino acids (with the tritium 
substituted at non- labile positions). 

Solid Supports 

A solid phase support for use in the present invention will be inert to the reaction conditions 
for binding. A solid phase support for use in the present invention must have reactive groups 
25 in order to attach a binding partner, such as an oligonucleotide encoding an /ENTP, an 

/ENTP, or an antibody to an /ENTP, or for attaching a linker or handle which can serve as the 
initial binding point for any of the foregoing. In another embodiment, the solid phase 
support may be a useful chromatographic support, such as the carbohydrate polymers 




WO 98/46749 PCT/US98/07283 

47 

SEPHAROSE, SEPHADEX, and agarose. As used herein,- a solid phase support is not 
limited to a specific type of support. Rather a large number of supports are available and are 
known to any person having skill in the art. Solid phase supports include silica gels, resins, 
derivatized plastic films, glass beads, cotton, plastic beads, alumina gels, magnetic beads, 
5 membranes (including but not limited to nitrocellulose, cellulose, nylon, and glass wool), 
plastic and glass dishes or wells, etc. For example, solid phase supports used for peptide or 
oligonucleotide synthesis can be used, such as polystyrene resin (e.g., PAM-resin obtained 
from Bachem Inc., Peninsula Laboratories, etc.), POLYH1PE® resin (obtained from 
Aminotech, Canada), poly amide resin (obtained from Peninsula Laboratories), polystyrene 
10 resin grafted with polyethylene glycol (TentaGel®, Rapp Polymere, Tubingen, Germany) or 
polydimethylacrylamide resin (obtained from Milligen/Biosearch, California). In synthesis 
of oligonucleotides, a silica based solid phase support may be preferred. Silica based solid 
phase supports are commercially available {e.g., from Peninsula Laboratories, Inc.; and 
Applied Biosystems, Inc.). 

15 Identification of Ligands for the iENTPs 

Identification and isolation of a gene encoding an /ENTP of the present invention provides 
for expression of i'ENTP in quantities greater than can be isolated from natural sources, or in 
indicator cells that are specially engineered to indicate the activity of (ENTP expressed after 
transfection or transduction of the cells. Accordingly, in addition to rational design of 

20 permeants and/or inhibitors based on the structure of /ENTP, the present invention 

contemplates an alternative method for identifying specific ligands (including permeants 
and/or inhibitors and the like) of an iENTP using various screening assays known in the art. 

Any screening technique known in the art can be used to screen for ligands to an (ENTP. 
The present invention contemplates screens for small molecule ligands or ligand analogs and 

25 mimics, as well as screens for natural ligands that bind to /ENTP and its activity. Inhibitors 
can include analogues of lidoflazine, mioflazine, and draflazine and the like [Griffith et aL, 
Biochem. Pharmacol., 40:2297-2303 (1 990); Baer et aL, Naunyn Schmiedebergs Arch 
Pharmacol. 343:365-369 (1991); Pirovano et al, Eur. J. Pharmacol. Mol Pharmacol, 
189:419-422 (1990); Pirovano et al y Nucleosides Nucleotides, 10:1177-1 179 (1991); 

30 Kruidering et al y Nucleosides Nucleotides, 10:1223-1224 (1991); Hammond, J. Pharmacol. 
Exp.Ther. 259:799-807 (1991 ); Van Belle et al., Nucleosides Nucleotides, 10:975-982 
(1991)]. Natural products libraries also can be screened using assays of the invention for 
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potential ligands to /ENTP. In addition, a large number of nucleoside analogues have been 
identified in the art and can be used in such screens. 

Knowledge of the primary sequence of the /ENTPs of the present invention., and the 
similarity of that sequence with proteins of known function, can provide an initial clue as to 
5 new ligands of the protein. Identification and screening of ligands is further facilitated by 
determining structural features of the /ENTP, e.g., using X-ray crystallography, neutron 
diffraction, nuclear magnetic resonance spectrometry*, and other techniques for structure 
determination. These techniques provide for the rational design or identification of ligands. 

Another approach uses recombinant bacteriophage to produce large libraries. Using the 
1 0 "phage method" [Scott and Smith, 1 990, Science 249:386-390 ( 1 990); Cwirla. ct al, Proc. 
Natl. Acad. Set, 87:6378-6382 (J 990); Devlin et al. Science, 249:404-406 (1990)J, very 
large libraries can be constructed ( 1 0 6 - 1 0 s chemical entities). A second approach uses 
primarily chemical methods, of which the Geysen method [Geysen et al, Molecular 
Immunology 23:709-71 5 (1986); Geysen et al. J. Immunologic Method 102:259-274 (1987)] 
1 5 and the method of Fodor et al. [Science 251 :76 7-773 ( 1 99 1 )] are examples. Furka et al. 

[14th International Congress of Biochemistry, Volume 5, Abstract FR:013 (1988); Furka, Int. 
J. Peptide Protein Res. 37:487-493 (1991)], Houghton [U.S. Patent No. 4,63K21 1, issued 
December 1986] and Rutter et al [U.S. Patent No. 5,010,175, issued April 23, 1991] describe 
methods to produce a mixture of peptides that can be tested as agonists or antagonists. 



20 In another aspect, synthetic libraries [Needels et al, Proc. Natl. Acad. Set. USA 90: 1 0700-4 
( 1 993); Ohlmeyer et al, Proc. Natl. Acad. Sci. USA 90: 1 0922- 1 0926 ( 1 993): Lam et al, 
International Patent Publication No. WO 92/00252; Kocis et al, International Patent 
Publication No. WO 9428028, each of which is incorporated herein by reference in its 
entirety], and the like can be used to screen for /ENTP ligands according to the present 

25 invention. 



The screening can be performed with recombinant cells that express the /ENTP, or 
alternatively, using purified protein, e.g., produced rccombinantly, as described above or 
from natural sources. For example, the ability of labeled, soluble or solubilized /ENTP or 
fragment thereof that includes the permeant-binding portion of the molecule, to bind the 
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permeant or inhibitor thereto, can be used to screen libraries, as described in the foregoing 
references. 



One such procedure comprises contacting a potential ligand with the isolated /ENTP under 
physiological conditions and detecting whether the potential ligand binds to the /ENTP. A 
5 potential ligand is selected as a ligand if it binds to the /ENTP. The binding can be detected 
with any of the standard protein-ligand binding assays known in the art as exemplified 
below. Either the iENTP or the ligand can be appropriately labeled as described above. 
Similarly, either the /ENTP or ligand can be attached to a solid support. Once a ligand is 
identified, its dissociation constant can be determined. The ligand is selected when the 
10 dissociation constant is less than 10" 5 M. 



The present invention also provides specific methods of identifying a permeant of an /ENTP 
using a recombinant cell that expresses the /ENTP, />., a cell transfected or transduce with 
/ENTP. In one such method, a potential permeant is contacted with a transfected or 
transduced cell in which all detectable nucleoside transport activity is performed by the 

1 5 /ENTP. The nucleoside transport of the potential permeant is evaluated in the transfected or 
transduced cell. The potential permeant is identified as a permeant when the transport of the 
potential permeant in the transfected or transduced cell is determined to follow a facilitated 
diffusion process. Potential permeants can include antiviral nucleoside analogs, antitumor 
nucleoside analogs, or natural nucleosides. In preferred embodiments of this type, the 

20 transfected or transduced cell is a transfected or transduced human cell. 



The present invention further provides specific methods for selecting a drug that inhibits an 
/ENTP using a recombinant cell that expresses the /ENTP as described above. In one such 
method, a potential drug is contacted with a transfected or transduced cell in which all 
detectable nucleoside transport activity is performed by the /ENTP. The nucleoside transport 

25 activity of the cell is evaluated. A potential drug is selected as a drug when a decrease in the 
nucleoside transport activity is determined relative to that determined when the evaluating 
was performed in the absence of the potential drug. The nucleoside transport activity of the 
transfected or transduced cell can be evaluated from any number of ways including: as a 
function of the determination of the trans-stimulation of a permeant; as a function of cell 

30 toxicity; as a function of the determination of the direct transport of a permeant. as a function 
of the determination of the countertransport of a permeant; as a function of the toxicity of a 
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nucleoside analog which is a permeant of the /ENTP, such as tubercidin, 2-chloro-2'- 
deoxyadenosine or Ara-C (wherein a Iwo-fold change or greater is considered significant); or 
as a function of two or more of these determinations. 

A drug can also be selected using the /ENTP-transfected or transduced cell described above 
5 by detecting the mutual inhibition (i.e. competition) of nucleoside transport in the transfected 
or transduced cell between a potential drug and a known permeant of the /ENTP (such as 
adenosine). A potential drug is selected as a drug when mutual inhibition is detected. The 
inhibition may be measured by any known means including those described above. 

Transgenic Vectors and Gene Therapy 
1 0 The functional activity of iENTP can be evaluated transgenically. In this respect, a 

transgenic animal model can be used [Archibald et al.. Int. Pat. Publ. WO90/05188; Hurwitz 
etal. y Int. Pat. Publ. WO93/03164; Bleck et al., U.S. Patent 5,530,177 issued June 25, 1996; 
Drohan et al U.S. Patent 5,589,604]. The iENTP gene can be used in complementation 
studies employing a transgenic mouse for example. Transgenic vectors, including viral 
1 5 vectors, or cosmid clones (or phage clones) corresponding to the wild type locus of candidate 
gene, can be constructed using the isolated iENTP gene. Cosmids may be introduced into 
transgenic mice using published procedures [Jaenisch, Science, 240: 1468- 1474 (1988)]. In a 
genetic sense, the transgene acts as a suppressor mutation. 

Alternatively, a transgenic animal model can be prepared in which expression of the iENTP 
20 gene is disrupted. Gene expression is disrupted, according to the invention, when no 

functional protein is expressed. One such method for preparing a such a "knockout mouse" 
is detailed by Capecchi et al.^ in U.S. Patent 5,464.764. A standard procedure for evaluating 
the phenotypic effect of a gene product is to employ knock-out technology to delete the gene. 
Alternatively, recombinant techniques can be used to introduce mutations, such as nonsense 
25 and amber mutations, or mutations thai lead to expression of an inactive protein. 

The present invention also extends to the preparation of antisense nucleotides and ribozymes 
that may be used to interfere with the expression of the /ENTPs at the translational level. 
This approach utilizes antisense nucleic acid and ribozymes to block translation of a specific 
mRNA, either by masking that mRNA with an antisense nucleic acid or cleaving it with a 
30 ribozyme. 
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Antisense nucleic acids are DNA or RNA molecules that are complementary to at least a 
portion of a specific mRNA molecule [see Marcus- Sekura, Anal. Biochem. 1 72:298 (1988)]. 
In the cell, they hybridize to that mRNA, forming a double stranded molecule. The cell does 
not translate an mRNA in this double-stranded form. Therefore, antisense nucleic acids 
5 . interfere with the expression of mRNA into protein. Oligomers of about fifteen nucleotides 
and molecules that hybridize to the AUG initiation codon will be particularly efficient, since 
they are easy to synthesize arid are likely to pose fewer problems than larger molecules when 
introducing them into organ cells. Antisense methods have been used to inhibit the 
expression of many genes in vitro [Marcus- Sekura, 1 988, supra; Hambor et ah, J. Exp. Med. 
10 168:1237 (1988)]. Preferably synthetic antisense nucleotides contain phosphoester analogs, 
such as phosphoroth folates, or thioesters, rather than natural phosphoester bonds. Such 
phosphoester bond analogs are more resistant to degradation, increasing the stability, and 
therefore the efficacy, of the antisense nucleic acids. 

1 5 Ribozymes are RNA molecules possessing the ability to specifically cleave other single 
stranded RNA molecules in a manner somewhat analogous to DNA restriction 
endonucleases. Ribozymes were discovered from the observation that certain mRNAs have 
the ability to excise their own introns. By modifying the nucleotide sequence of these RNAs, 
researchers have been able to engineer molecules that recognize specific nucleotide 

20 sequences in an RNA molecule and cleave it [Cech, J. Am. Med. Assoc. 260:3030 (1988)]. 
Because they are sequence-specific, only mRNAs with particular sequences are inactivated. 

Investigators have identified two types of ribozymes, Tetrahymena-Xypz and 
"hammerhead "-type. Tetrahymena-type ribozymes recognize four-base sequences, while 
"hammerhead"-type recognize eleven- to eighteen-base sequences. The longer the 
25 recognition sequence, the more likely it is to occur exclusively in the target mRNA species. 
Therefore, hammerhead-type ribozymes are preferable to Tetrahymena-iypz ribozymes for 
inactivating a specific mRNA species, and eighteen base recognition sequences are 
preferable to shorter recognition sequences. 

The DNA sequences encoding the /ENTPs described and enabled herein may thus be used to 
30 prepare antisense molecules against and ribozymes that cleave mRNAs for the /ENTPs, thus 
inhibiting a cell from expressing the gene encoding the /ENTP, thereby hindering or 
curtailing the nucleoside transport of a specific nucleoside or nucleoside analog into the cell. 
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In one embodiment, a gene encoding an /ENTP or active fragment thereof is introduced in 
vivo in a viral vector. Such vectors include an attenuated or defective DNA virus, such as but 
not limited to herpes simplex virus (HSV), papillomavirus, Epstein Ban* virus (EBV), 
adenovirus, adeno-associated virus (AAV), and the like. Defective viruses, which entirely or 
5 . almost entirely lack viral genes, are preferred. Defective virus is not infective after 

introduction into a cell. Use of defective viral vectors allows for administration to cells in a 
specific, localized area, without concern that the vector can infect other cells. Examples of 
particular vectors include, but are not limited to. a defective herpes virus 1 (HSV 1 ) vector 
[Kaplitt etal^ Molec. Cell. Neurosci. 2:320-330 (1991)]. an attenuated adenovirus vector, 
10 such as the vector described by Strattbrd-Perricaudet et aL [J. Clin. Invest. 90:626-630 
(1992)], and a defective adeno-associatcd vims vector [Samulski et aL. J. Virol. 61:3096- 
3101 (1987): Samulski etaL.J. Virol. 63:3822-3828(1989)]. 

Preferably, for in vitro administration, an appropriate immunosuppressive treatment is 
employed in conjunction with the viral vector, adenovirus vector, to avoid immuno- 
1 5 deactivation of the viral vector and transduced cells. For example, immunosuppressive 

cytokines, such as intcrleukin-12 (IL-12), interferon-y (IFN-y), or anti-CD4 antibody, can be 
administered to block humoral or cellular immune responses to the viral vectors [see, e.g. y 
Wilson, Nature Medicine (1995)]. In addition, it is advantageous to employ a viral vector 
that is engineered to express a minimal number of antigens. 

20 In another embodiment the gene can be introduced in a retroviral vector, e.g.* as described in 
Anderson et aL, U.S. Patent No. 5,399,346; Mann etaL, 1983, Cell 33:153;Temin etal, 
U.S. Patent No. 4,650,764; Temin et aL, U.S. Patent No. 4,980,289; Markowitz et aL, 1988, 
J. Virol. 62:1 120; Temin et al„ U.S. Patent No. 5,124,263: International Patent Publication 
No. WO 95/07358, published March 16, 1995, by Dougherty etaL; and Kuo et aL, 1993, 

25 Blood 82:845. 

Targeted gene delivery is described in International Patent Publication WO 95/28494, 
published October 1995. 



30 



Alternatively, the vector can be introduced in vivo by lipofection. For the past decade, there 
has been increasing use of liposomes for encapsulation and transfection of nucleic acids in 
vitro. Synthetic cationic lipids designed to limit the difficulties and dangers encountered 
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with liposome mediated transfection can be used to prepare liposomes for in vivo transfection 
of a gene encoding a marker [Feigner, et. al., Proc. Natl Acad. Set U.S.A. 84:7413-7417 
(l987);.?£eMackey, etal.^Proc. Natl Acad. Set. U.S.A. 85:8027-8031 (1988)]. The use of 
cation ic lipids may promote encapsulation of negatively charged nucleic acids, and also 
5 promote fusion with negatively charged cell membranes [Feigner and Ringold, Science 

337:387-388 (1989)]. The use of lipofection to introduce exogenous genes into the specific 
organs in vivo has certain practical advantages. Molecular targeting of liposomes to specific 
cells represents one area of benefit. It is clear that directing transfection to particular cell 
types would be particularly advantageous in a tissue with cellular heterogeneity, such as 
10 pancreas, liver, kidney, and the brain. Lipids may be chemically coupled to other molecules 
for the purpose of targeting [see Mackey, et. al., supra]. Targeted peptides, e.g., hormones or 
neurotransmitters, and proteins such as antibodies, or non-peptide molecules could be 
coupled to liposomes chemically. 

It is also possible to introduce the vector in vivo as a naked DMA plasmid. Naked DMA 
1 5 vectors for gene therapy can be introduced into the desired host cells by methods known in 
the art, e.g., transfection, electroporation, microinjection, transduction, cell fusion, DEAE 
dextran, calcium phosphate precipitation, use of a gene gun, or use of a DMA vector 
transporter [see, e.g., Wu et al, J. Biol. Chem. 267:963-967 (1992); Wu and Wu, J. BioL 
Chem. 263:14621-14624 (1988); Hartmut et al. y Canadian Patent Application No. 2,012,31 1, 
20 filed March 15, 1990]. 

In a preferred embodiment of the present invention, a gene therapy vector as described above 
employs a transcription control sequence operably associated with the sequence for the 
/ENTP inserted in the vector. That is, a specific expression vector of the present invention 
can be used in gene therapy. 

25 Such an expression vector is particularly useful to regulate expression of a therapeutic iENTP 
gene. In one embodiment, the present invention contemplates constitutive expression of the 
iENTP gene, even if at low levels. 

The present invention provides a method of using such expression vectors in cancer 
chemotherapy. Transducing normal hematopoietic stem cells ex vivo, with cDNA encoding 
30 the NBMPR-insensitive equilibrativc nucleoside transport protein, would offer protection to 
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the transduced cells in vivo, during antimetabolite therapy that is coupled with the 
administration of NBMPR, a potent inhibitor of the es nucleoside transporter. The 
transduced cells, which express the /ENTP, would survive the NBMPR and antimetabolite 
therapy through the /ENTP-dependent salvage of nucleosides from exogenous pools. In 
5 contrast untransduced cells and tumor cells which do no express the zENTP, would be 
selectively diminished as the antimetabolite blocks de novo nucleoside biosynthesis, and 
NBMPR prevents nucleoside salvage via the es transporter. 

The dosage of NBMPR to be administered can be empirically determined with initial studies 
in mouse models, and then in higher mammals. These are the type of studies that have 

1 0 become routine for those having skill in the art of chemotherapy. For example, normal mice 
can be treated by intraperitoneal administration with varying doses of methotrexate (MTX; 
20 mg/kg- 200 mg/kg) for five consecutive days, to determine cytotoxic effects on both 
normal myeloid progenitor cells and hematopoietic stem cells with this antimetabolite. Other 
nucleoside antimetabolites such as PALA, 5-fluorouracil, AraC, and AZT, can be similarly 

1 5 tested (as required) at clinically relevant dosages. The marrow from drug treated and 

untreated mice are examined for myeloid progenitor cell numbers to determine the degree of 
cytotoxic effects. 

NBMPR (0.1 ^M to 10 fuM) and draflazine (0. 1 to 10 /zM) can be added in conjunction 
with the antimetabolite (at concentrations found to be cytotoxic in the above study) to murine 
20 bone marrow cells in an in vitro culture system optimized using medium supplemented with 
hematopoietic growth factors (1L-3 at 20 ng/ml, human IL-6 at 50 ng/ml, rat SCF at 50 
ng/ml) and fetal bovine serum. The effect of NBMPR or draflazine on potentiating the 
activity of the antimetabolites are assessed by cell count measurements over a 24 hour period 
after treatment. 

25 Hematopoietic stem cells can be transduced with either viral constructs (e.g., retrovirus, 
adeno-associated virus or lenti virus) containing a nucleic acid encoding an /ENTP (e.g. y 
hENT2) or a control nucleic acid, e.g., encoding MDR1 [Pastan et aL, Proc. Nat. Acad. Sci. 
85:4486 (1988)]. Mice can then be transplanted with either hENT2 or MDR1 transduced 
cells e.g., the cells may be used as donors for bone marrow transplantation [ Torok-Storb et 

30 al 9 Bone Marrow Transplant. 14: Suppl 4: S71-S73 (1994); Allay et aL. Blood 88:645a 
(1996); PCT Application US/96/1 7660, filed November 4, 1 996 designating the United 
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States, entitled "In Vivo Selection of Primitive Hematopoietic Cells" having Sorrentino etal. 
as the Inventors, incorporated herein by reference in its entirety.] The mice are next 
subjected to antimetabolite treatment in the presence or absence of a nucleoside transport 
inhibitor (e.g., NBMPR or draflazine) over a period of about 1 0 or more days, and at the 
5 . concentrations determined above. The in vivo enrichment of myeloid progenitors is assessed 
by comparing marrow cell ulari ties between untreated control mice, transduced mice treated 
with the nucleoside transport inhibitor, and transduced mice not treated with nucleoside 
transport inhibitor. 

In a related aspect, the present invention provides a method of overcoming a major limitation 
10 to successful hematopoietic cell-directed gene therapy. This major limitation is inefficient 
gene transfer into repopulating stem cells. The method is based on the selection of cells that 
express a desired heterologous gene due to linkage of the heterologous gene to a gene that 
encodes an /ENTP. 

Effective in vivo enrichment of transduced cells requires the elimination of unmodified 
15 hematopoietic cells. This can be effected by the administration of an antimetabolite such as 
trimetrexate (TMTX) in the presence of an es transporter inhibitor that preferentially inhibits 
the es transporter relative to the ;ENTP ? (e.g., NBMPR). In the absence of expression of an 
/ENTP, such a regimen has been shown to be toxic to hematopoietic progenitor cells both in 
vitro and in vivo [Allay et ai. Blood 88:Supp. 1 3 645a ( 1 996)]. In contrast, cells expressing 
20 /ENTP can rely on the /ENTP-dependent nucleoside salvage pathway in the presence of the 
inhibitors to de novo nucleoside synthesis and the es transporter-dependent nucleoside 
salvage pathway as described above. Therefore, linking the heterologous gene with the 
expression of the /ENTP will ensure that surviving hematopoietic cells contain the desired 
heterologous gene when the cells are treated with drugs such as TMTX and NBMPR as 
25 described above. 

Various therapeutic heterologous genes can also be inserted in a gene therapy vector of the 
invention such as but not limited to adenosine deaminase (ADA) to treat severe combined 
immunodeficiency (SCID); marker genes or lymphokine genes into tumor infiltrating (TIL) 
T cells [Kasis ct al, Proc. Nad. Acad. ScL U.S.A. 87:473 (1990); Culver et al. 9 ibid. 88:3155 
30 (1991)]; genes for clotting factors such as Factor VT11 and Factor IX for treating hemophilia 
[Dwarki et al Proc. Natl Acad. Set. USA, 92:1023-1027 (19950); Thompson, Thromh. and 
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Haemostatis, 66:1 19-122 (1991)]; and various other well known therapeutic genes such as, 
but not limited to. P-globin, dystrophin, insulin, erythropoietin, growth hormone, 
glucocerebrosidase, (^-glucuronidase, cc-antitrypsin, phenylalanine hydroxylase, tyrosine 
hydroxylase, ornithine transcarbamylase, apo lipoproteins, and the like. In general, see U.S. 
5 . Patent Mo. 5,399,346 to Anderson et al 

In another aspect, the present invention provides for regulated expression of the heterologous 
gene in concert with the expression of the /ENTP. The present invention provides for co- 
expression of i'ENTP and a therapeutic heterologous gene under control of a specific DNA 
recognition sequence by providing a gene therapy expression vector comprising both a 

10 iENTP coding gene and a gene under control of, inter alia., the /ENTP regulatory sequence. 
Concerted control of such heterologous genes may be particularly useful in the context of 
treatment for proliferative disorders^ such as tumors and cancers, when the heterologous gene 
encodes a targeting marker or immunomodulatory cytokine that enhances targeting of the 
tumor cell by host immune system mechanisms. Examples of such heterologous genes for 

15 immunomodulatory (or immuno-effector) molecules include, but are not limited to, 

interferon-ot, interferon-y, interferon- p, intcrferon-to, interferon-T, tumor necrosis factor-a, 
tumor necrosis factor-p, interleukin-2, interleukin-7, interleukin-12, interleukin-15, B7-1 T 
cell co-stimulatory molecule, B7-2 T cell co-stimulatory molecule, immune cell adhesion 
molecule (ICAM) -1 T cell co- stimulatory molecule, granulocyte colony stimulatory factor, 

20 granulocyte-macrophage colony stimulatory factor, and combinations thereof. 

The present invention may be better understood by reference to the following non-limiting 
Examples, which are provided as exemplary of the invention. These examples are presented 
in order to more fully illustrate the preferred embodiments of the invention. They should in 
no way be construed, however, as limiting the broad scope of the invention. 

25 EXAMPLE I 

Molecular Cloning of the Equilibrati ve. Nitrohenzvlmercaptopurineriboside fNBMPRV 
lnsensitive Nucleoside Transporter ei: a Delayed Early Response Gene 



30 



Introduction 

Mammalian cells obtain nucleic acid precursors through the de novo synthesis of nucleotides 
and the salvage of exogenous nucleobases and nucleosides. The first step in the salvage of 
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nucleosides is their transport across the plasma membrane. Several nucleoside transport 
activities, including both equilibrative and concentrative mechanisms, have been identified 
by their functional properties. Until recently, however, little has been known about the 
proteins that mediate these transport processes. Two of the concentrative transporters have 

5 now been cloned, and just recently the equilibrative NBM PR-sensitive transporter es was 
cloned (Griffiths et aL, Nature Med. 3:89-93. 1997). The protein mediating equilibrative 
NBMPR- insensitive transport (ei), however, has remained elusive. The cloning of a 2522 
basepair (bp) cDNA from HeLa cells that encodes a functional ei transport protein is 
disclosed in this example. This cDNA was cloned by complementation of a defect in 

10 nucleoside transport in subline of CEM human leukemia cells. The cDNA encodes a 456- 
residue protein with 10 to II predicted membrane-spanning regions. Stable expression of 
this cDNA in nucleoside transport-deficient CEM cells, as well as transient expression in 
COS cells, conferred equilibrative, NBM PR- in sensitive transport activity to the cells. 

The predicted protein is highly homologous (50% identity, 69% similarity) to the recently 
15 cloned human NBMPR-sensitive equilibrative nucleoside transporter hENTl , and thus has 
been designated hENT2. Surprisingly, the carboxy terminal portion of the predicted hENT2 
protein is nearly identical to a 326 residue predicted peptide (hHNP36) in the Genbank 
database that has been identified as a growth factor-induced "delayed early response" gene of 
unknown function. 

20 The transient transfection studies with full length hENT2 and a 5' truncated construct that 
lacks the first start codon (predicted protein 99% identical to hHP36) demonstrated that a 
functional nucleoside transport protein is not produced from the second start codon. These 
data suggest that the hHNP36 protein is a truncated, non-functional form of hENT2. Since 
the hHNP36 cDNA was originally cloned as a delayed early response gene (derl2), hENT2 

25 may be a delayed early response gene, and nucleoside transport may play an important role 
in the proliferative response when quiescent cells are stimulated into cycle by growth factors. 

Materials and Methods 
Cells and Growth Conditions. HeLa S3 cells and COS-1 cells from the American Type 
Culture Collection were grown at 37°C in a humidified air and 5% C0 3 atmosphere in 
30 Dulbecco's Modified Eagle Medium (D-MEM). For HeLa S3 cells the medium was 

supplemented with 10% heat- inactivated fetal calf serum and for COS- 1 cells with 5% heal- 



WO 98/46749 PCT/US98/07283 

58 

inactivated fetal calf serum plus 5% NuSerum IV (Collaborative Research Products). The 
CEM cell lines were grown as stationary suspension cultures at 37°C in a humidified air and 
5% C0 2 atmosphere in RPMI 1640 medium supplemented with 10% heat- inactivated horse 
serum and the following additions: CEM/AraC-8C, 0.25 |iM tubercidin [4-amino-7-P-D- 
5 . ribofuranosylpyrolof2,3-d|pyrimidinej /0.5 u.M AraC [l-p-D-arabinofuranosylcytosine]; 
CEM/C-19 cells, 0.25 uM tubercidin/0.5 uM AraC/50 ug/ml G418; N-l-7, 20 uM DUP-785 
[6-fluoro-2(2 ? — fluoro-1, 1 '-biphenyl-4-yl)-3-rnethyI-4-quinoline carboxylic acid sodium 
salt]/100 fiM uridine [Dexter et al Cancer Res. 45:5563 (1985)]. The nucleoside transport 
deficient cell line CEM/AraC-8C [Ullman etal^J. Biol. Chem. 263:12391-12396 (1988)] 
10 was provided by Dr. Buddy Ullman, Oregon Health Sciences University, Portland Or. DUP- 
785 was provided by the National Cancer Institute. 

Isolation of C-19 cells. A mixture containing CEM/AraC-8C cells (2x1 0 7 ) and 10 fig of a 
plasm id mixture consisting of Sea 1 -restricted pCMVEBNA (Clontech) and Sea 1 -restricted 
pRSVneo at a 20: 1 molar ratio was electroporated at 190 V at a capacitance of 960 p.F 
1 5 utilizing a Gene Pulsar obtained from BioRad. After a 48 hour of recovery, the cells were 
selected in medium containing 200 |Jg/ml of G4I8 (Geneticin, obtained from Life 
Technologies) for 25 days. Surviving cells were cloned by plating in soft agarose (0.35%) 
containing 200 ng/ml G418. After 21 days colonies were transferred to liquid culture and 
grown for characterization. 

20 Expression cloning, A Clontech HeLa S3 cell c-DNA library in the pDR2 vector was 

transfected into C-19 cells (2x10*) by electroporation as described above. In bulk culture, the 
electroporated cells were sequentially selected in medium containing 200 ug/ml hygromycin 
B (14 days), 20 uM DUP-785/100 u=M uridine (27 days) and finally 20 uM DUP-785/100 
uM uridine/1 uM NBMPR [(nitrobenzylmercaptopurineriboside), 6-[(4-nitrobenzyl)thio-9-p- 

25 D-ribofuranosyl purine] (14 days). Plasm ids were extracted (utilizing a Ql A-prep Spin 
Plasmid Miniprep Kit, Qiagen) from cells surviving selection and, subsequently used to 
transform electrocompetent WM 1 100 E. coli cells (obtained from BioRad). Plasm ids from 
individual E. coli colonies were analyzed by restriction digestion with BamHl plus Xbal. 
Plasmids containing inserts were individually reintroduced into CEM/C19 cells which were 

30 then selected as before. 
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Sequencing of the pDR2/N171 insert. Both strands of the insert were sequenced to a level of 
3 to 7-fold redundancy by Taq DycDcoxy terminator cycle sequencing on an automated 
Model 373 A DNA Sequencer, (obtained from Applied Biosystems). 

Northern analysis. PolyA+-RNA was isolated from the indicated cell lines using the 
5 FastTrack 2.0 kit from Invitrogen. The RNA (2 ug/lane) was separated on a formaldehyde 
reducing 1% agarose gel and transferred to charged nylon membranes (obtained from 
Hybond-N, Amersham Corp.). A BamlTl/Nhe] cDNA fragment (1.8 kb) encompassing 
nucleotides 393-21 83 was gel purified and labeled with J2 P-dCTP using the Primelt kit from 
Strategene. Hybridization was carried out for 16 hours at 42° C in 50% formamide 
1 0 containing 1 0% dextran sulfate. The blot was washed at high stringency (0.2 X SSPE at 65 ° 
C) and analyzed using a Phosphorlmager and ImageQuant software. A multiple human 
tissue blot (obtained from Clontech) with 2 u,g of polyA+ RN A/lane was also probed under 
identical conditions. 

Transient Expression in COS- J cells: The N 1-71 clone contained a 1368 bp open reading 
1 5 frame with two potential start codons. The full orf plus the 3 '-untranslated region were 

excised from pDR2/Nl-71 in a Bgll/Xbal fragment (bp 233-2605) and directionally cloned 
into the multicloning site of pcDNA3 (Invitrogen) to give pcDNA3/NI 71orfl . Likewise, the 
second start site and 3 '-untranslated region were excised in a Bglll/Xbal fragment (bp 1 104- 
2605) and ligated into pcDNA3 to give pcDNA3/Nl 71 orf2. These constructs were 
20 transfected into COS- 1 cells by the DEAE-dextran method and uptake of uridine determined 
as described by Fang et al. [Biochem. X 317:457-465 (1996)]. 

Results 

Cloning Strategy. Since an NBMPR-msensitive equilibrative nucleoside transport protein 
had not previously been identified, and there were no antibodies or affinity probes available, 

25 a cloning strategy based on the functional expression of ei transport activity in a nucleoside 
transport deficient cell line was devised. A nucleoside transport deficient subline of the 
human T-cell leukemia CEM [Ullmaii et aL, J.BioLChem. 263:12391-12396(1988)] was 
transfected with an EBNA- 1 (Epstein-Barr Nuclear Antigen 1 ) expression cassette to produce 
a transport deficient cell line capable of supporting the episomal replication of the EBV- 

30 based mammalian expression vector pDR2. This cell line, designated CEM/C19, had a stable 
transfection frequency with pDR2 of approximately 10 2 . which was four orders of magnitude 
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greater than that of the parental EBNA-negative cell line. CEM/C19 cells were sensitive to 
the de novo uridylate synthesis inhibitor DUP-785, but could not be rescued from DUP-785 
toxicity by uridine because of their transport defect. In contrast, transport competent CEM 
cells were readily rescued from DUP-785 toxicity by 100 uM uridine. CEM/CI9 cells were 
5 transfected with a pDR2 human cDNA library from HeLa cells, which express es and ei 
transport activities [Dahlig-Harley el al, BiochemJ. 200:295-305 (1981)] but do not have 
any detectable sodium-dependent nucleoside transport activity. Since CEM/C19 cells do not 
clone well in soft agar, batch cultures of transfected cells were subjected to sequential 
selection as follows: (1) hygromycin to select for transfected cells, (2) DUP-785 plus uridine 

1 0 to select for cells expressing any nucleoside transporter (all known transporters accept 
uridine as a permeant), and (3) DUP-785 plus uridine and NBMPR to select for cells 
expressing NBMPR-insensitive uridine transport activity. Surviving cell cultures were 
screened for NBMPR-insensitive uridine uptake, and plasmids extracted from positive 
cultures and rescued back into E. colL Plasmids isolated from individual £. coli colonies 

15 were analyzed for the presence of inserts and reintroduced into CEM/C-1 9 cells. The cells 
were then selected for expression of NBMPR- insensitive uridine transport activity as before. 
A plasmid (pDR2/Nl-7) containing a 2.6 Kb insert was identified that permitted ceil survival 
upon reintroduction and selection. The cell line was designated CEM/N 1-7 and used for 
further functional characterization of the plasmid. 



20 Uridine Transport in CEM/N 1-7 Cells. To confirm the presence of uridine transport activity 
in the transfected cells, uridine influx was compared in CEM/NI-7 and CEM/C19 cells 
(Figure 1 A). CEM/NI-7 cells displayed a large component of uridine influx that was at least 
10-fold greater than that of the CEM/C19 cells. Addition of a large excess of unlabeled 
uridine blocked uridine transport in the transfected cells, indicating the presence of a 

25 saturable carrier mediated process; but had no effect on uridine uptake in CEM/C19 cells, 
suggesting that the slow rate of uptake was due to simple diffusion. Removal of sodium 
from the buffer had no effect on transport in CEM/N 1-7 indicating the presence of an 
equilibrative type transporter. Addition of NBMPR at a concentration of 0.1 uM, which is 
sufficient to block es mediated transport, had no effect on transport in CEM/N 1 -7 cells 

30 (Figure 1 B) suggesting that the cDNA insert encoded the NBMPR-insensitive equilibrative 
nucleoside transporter ei. Partial inhibition of transport at higher concentrations of NBMPR 
(Figure 1 B) was also consistent with the described properties of the human ei transporter in 
HeLa cells where the JC 50 values (concentration producing a 50% inhibition) for inhibition of 
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uridine transport via es and ei are 1 nM and 6 u,M, respectively [Dahlig-Harley et aL, (1981), 
supra]. Also consistent with the ei transporter of HeLa cells [Dahlig-Harley et aL, (198 1 ), 
supra] , transport in CEM/N1-7 cells was completely blocked by 10 uM DIP [dipyridamole, 
or bis(diethanolamino>4,8-dipiperidinopyrimido-[5,4-d]-2,6-pyrimidine] (Figure IB). As 
5 expected from the permeant selectivity observed for ei activity in several cell lines 

[Plagemann ei aL, Biochim. Biophys. Ada, 947:405-444; Cass, Nucleoside Transport in N.H. 
Gergopapadakou (ed.). Drug Transport in Antimicrobial Therapy and Anticancer Therapy, 
pp. 403-451, New York: Marcel Dekker (1995); Griffeth and Jarvis, 1996, supra], uridine 
influx in CEM/NI-7 cells was inhibited by both purine and pyrimidine nucleosides (Figure 

10 1 C), but not by the corresponding nucleotides. Interestingly, uridine transport in CEM/N 1 -7 
cells was inhibited by the nucleobase hypoxanthine, but not by the other nucleobases tested 
(Figure 1C). While ei activity is generally considered a nucleoside transporter, previous 
studies have suggested that it may also transport hypoxanthine (reviewed in [Griffith and 
Jarvis, 1996, supra]). This was recently confirmed by Jarvis and colleagues [Osses et aL, 

15 Biochem, J., 317:843-848 (1996)] by direct measurements of hypoxanthine in human 
vascular endothelial cells. In summary, the uridine transport activity demonstrated in 
CEM/N 1-7 cells was Na + -independent, inhibited by physiological nucleosides, such as 
hypoxanthine and dipyridamole, but relatively insensitive to inhibition by NBMPR. All 
these features are consistent with those of an ei transporter, i.e., an z'ENTP. 

20 Rescue and analysis of the plasmid. Plasm ids were extracted from CEM/N 1 -7 cells and 
rescued back into E. coli. Restriction analysis of the plasmids from individual E. coli 
colonies demonstrated the presence of a single plasmid containing a 2.6 Kb insert which 
appeared to be identical to the plasmid initially introduced into CEM/C1 9 cells to create the 
CEM/N 1-7 cell line. The insert from one of the recovered cloned plasmids (designated Nl- 

25 71) was sequenced. The Nl-71 cDNA was 2522 basepairs and contained a 1368 basepair 
open reading frame that encodes a 456 residue protein with 10 to 11 predicted membrane 
spanning regions (Figure 2). 

The Nl-71 protein exhibits 50% identity (69% similarity) to the recently cloned human 
NBMPR sensitive nucleoside transport protein hENTI, Figure 2 [Griffiths et aL Nature Med. 
30 3:89-93 (1997)], and thus has been designated hENT2, as a member of this family of 
proteins. As noted previously members of this family can also be found in yeast and 
nematodes [Griffiths et aL (1997), supra]. Analysis of the aligned sequences shown in 
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Figure 2 predict 10 transmembrane domains for this family of proteins, although analysis of 
either hENTI [Griffiths ct al. (1997), supra] or hENT2 alone predict 1 1 transmembrane 
domains. As seen with other membrane transporter families* the most highly conserved 
regions of the ENT proteins fall in the transmembrane domains. All members of the family 
5 . have an extracellular loop, with an >J-glycosyIation site between transmembrane domains 1 
and 2 in nine of the ten reported family members. The length of the extracellular loop is 
variable, and there is very little conservation of sequence within the loop except for the "N- 
glycosylation site. 

Surprisingly, the carboxy terminal portion of the predicted hENT2 protein is also nearly 
10 identical to a 326 amino acid residue predicted peptide (hrfNP36) in the Gcnbank database 
that has been identified as a growth factor- induced "delayed early response*' gene of 
unknown function [Williams et al, Biochem. Biophys. Res. Comm., 213:325-333 (1995)]. 
Inspection of the hHNP36 nucleotide sequence revealed two potential open reading frames 
with hHNP36 translated from the second start codon. While hENT2 also has two potential 
15 start codons, they are within the same open reading frame. The full length cDNA of hHNP36 
(2281 bp) is nearly identical to hENT2, but contains a 68 bp deletion beginning at position 
338. This deletion shifts initial reading frames relative to hENT2 and would result in a 
truncated 22 Kd protein with only 51% identity to the hENT2 protein. Transient transfection 
studies with full length hENT2 and a 5'-truncated construct that lacks the first start codon 
20 (predicted protein 99% identical to hHNP36 (Figure 3)) demonstrated that a functional 

nucleoside transport protein is not produced from the second start codon. These data suggest 
that the hHNP36 protein is a truncated, non-functional form of hENT2. 

Expression ofhENT2 in human cell lines and tissues. Northern blots of po!yA+ RNA from 
HeLa, parental CEM cells, the transport deficient recipient cell line CEM/C1 9, and the stable 

25 transfectant CEM/N 1 -7 were probed at high stringency using a 32 P labeled BamHl/Nhel 
fragment of N 1-7 1 that encompasses 90% of the coding region of hENT2. As shown in 
Figure 4 A, a single transcript of approximately 2.6 Kb was identified in Hel .,a cells, the cell 
line from which the cDNA library was derived. A slightly larger transcript (approximately 
3.0 Kb) was observed in the stable transfectant CEM/N1-7. The larger size of the message in 

30 CEM/N1-7 cells can be accounted for by the fact that in these cells the message is derived 
from transcriptional start and- termination sites of the pDR2 vector, which adds 
approximately 460 nucleotides to the insert. These data indicate that N 1 -7 1 represents the 



WO 98/46749 PCT/US98/07283 

63 

full length cDNA for the ei transporter. As shown in Figure 4A. no message was detected in 
the transport deficient recipient cell line CEM/C19. 

The tissue distribution of the ei transporter was examined using human multiple tissue blots 
from CLONTECH (Fig. 4B). As expected from previous functional studies of ei transport 
5 activity in cultured cell lines, ENT2 was found to be expressed in a number of tissues, and 
the level of expression was variable among tissues. A message of about 2.6 kbp was detected 
in most tissues, but a larger transcript (*=4 kbp) was also observed in thymus, prostate, heart, 
brain, lung, skeletal muscle, and pancreas. The highest level of ENT2 expression was in 
skeletal muscle, with the 2.6-kbp message predominating. The high level of expression in 

10 skeletal muscle was unexpected, as this tissue is composed of nondividing, terminally 
differentiated cells. It is possible, however, that the ENT2 transporter plays a role in the 
efflux of inosine and hypoxanthinc from muscle cells during the net degradation of purine 
nucleotides that occurs during strenuous exercise and/or in the re- uptake of these purines 
during the recovery process [Arabadjis et al t Am. J. Physiol, 264:C1246-C1251 (1993); 

1 5 Norman el al, Clin. Physiol, 7:503-5 10 ( 1 987)]. 



EXAMPLE 2 



Protection of hematopoietic stem cells with a cDNA encoding a NBMPR-i nsensitive 
eauilibrative nucleoside tr ansport (t'ENTP^ for use in Chemotherapy: 

Introduction 

20 Antimetabolites such as trimetrexate, methotrexate, PALA and 5-fluorouracil are commonly 
used in the clinical treatment neoplastic disorders, including cancers. These drugs were 
designed to block de novo nucleotide synthesis, and thereby prevent the proliferation of the 
otherwise rapidly replicating tumor cells. One common problem associated with such 
treatments arises when the targeted tumor cells circumvent the cytotoxic effects of the de 

25 novo synthesis inhibitor by acquiring purine and pyrimidine nucleosides from exogenous 
pools through a nucleoside salvage pathway. In fact, tumor cells commonly express 
significant levels of nucleoside transporters. 

Furthermore, nucleoside analogs such as cytosine arabinoside (Ara-C), 2-Chloro-2*- 
deoxyadenosine, AZT, and ddl are commonly used to treat viral and neoplastic disorders 
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including AIDS and cancers. Nucleoside transporters are involved in the uptake and efflux 
of these drugs by cells. Thus the administration of NBMPR following the cytotoxic 
nucleoside may enhance the activity of these drugs by blocking their exit from the cell and 
prolonging the exposure of the cellular targets to the drug and its active metabolites. 

5 The major form of nucleoside transport seen in 9/9 leukemia, 4/4 rhabdomyosarcoma and 4/4 
colon carcinoma cell lines is performed by an NBMPR-sensitive, equilibrative (es) 
transporter [Belt et al. Advan, Enzyme ReguL, 33:235-252 (1993)]. Therefore, 
supplementing the antimetabolite regimen with NBMPR serves to potentiate the desired 
cytotoxic effects in the tumor cells. Unfortunately, the es transporter is also the major 

10 nucleoside transporter in normal bone marrow cells [Belt et al. Advan. Enzyme ReguL, 
33:235-252 ( 1 993)], and thus administering NBMPR with the antimetabolites also 
potentiates an undesired cytotoxic effect in normal myeloid progenitors in bone marrow 
cells. Described herein, is a method of clinically treating viral and neoplastic disorders, 
including cancers and AIDS, which allows for the potentiation of the desired cytotoxic 

1 5 effects of antimetabolites in tumor cells by co-administering NBMPR, which also protects 
the hematopoietic stem cells from the undesired cytotoxic effects of such treatment. 

Materials and Methods 

Treatment of normal myeloid progenitor cells with antimetabolites to determine resistance. 
Normal mice are treated by intraperitoneal administration with varying doses of methotrexate 

20 (MTX; 20 mg/kg- 200 mg/kg) for five consecutive days., to determine cytotoxic effects on 

normal myeloid progenitor cells and hematopoietic stem cells with this antimetabolite. Other 
nucleoside antimetabolites such as trimetrexate, PALA, 5-fluorouracil, AraC, and AZT, are 
similarly tested (as required) at clinically relevant dosages. The marrow from drug treated 
and untreated mice are examined for myeloid progenitor cell numbers to determine the 

25 degree of cytotoxic effects. 

The use of NBMPR or draflazine to potentiate the activity of antimetabolite in vitro. NBMPR 
(0. 1 fjM to 10 ^M) and/or draflazine (0. 1 ptM to 1 0 //M) are added in conjunction with (or 
directly before or after) an antimetabolite to murine bone marrow cells in an in vitro culture 
system optimized using medium supplemented with hematopoietic growth factors (IL-3 at 20 
30 ng/ml, human IL-6 at 50 ng/ml, rat SCF at 50 ng/ml) and fetal bovine serum. The 

concentration of antimetabolite used is that found to be cytotoxic in the above study. The 
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effect of NBMPR or draflazine on potentiating the activity of the antimetabolites are 
assessed by cell count measurements over a 24 hour period after treatment. 

In vivo selection of myeloid progenitor cells using hEN'12 cDNA. Hematopoietic stem cells 
are transduced with either viral constructs (retrovirus, ad e no- associated virus or lenti virus) 
5 containing a nucleic acid encoding an /ENTP (e.g., HENT2) or a control nucleic acid 

(e.g., encoding MDR1). Mice are then transplanted with either hENT2 or MDR I transduced 
cells e.g., the cells may be used as donors for bone marrow transplantation [ Torok-Storb et 
al. y Bone Marrow Transplant. 14: Suppl 4: S71-S73 (1994); Allay et al, Blood 88:645a 
(1996); PCT Application, US/96/17660, filed November 4, 1996 designating the United 

10 States, entitled "In Vivo Selection of Primitive Hematopoietic Cells" having Sorrentino et al 
as the Inventors, incorporated herein by reference in its entirety]. The mice are next 
subjected to antimetabolite treatment in the presence of a nucleoside transport inhibitor (the 
antimetabolite and the nucleoside transport inhibitor also may be administered separately) or 
in the absence of a nucleoside transport inhibitor, e.g., NBMPR or draflazine, over a period 

1 5 of about J 0 days or more, and at the concentrations determined above. The in vivo 

enrichment of myeloid progenitors is assessed by comparing marrow cellularities between 
untreated control mice, transduced mice treated with the nucleoside transport inhibitor, and 
transduced mice not treated with nucleoside transporter inhibitor. 

Results 

20 The solution presented herein relies on the transduction of normal hematopoietic stem cells 
ex vivo with a cDNA encoding an /ENTP. Such a transduced cell when transplanted 
expresses /ENTP in vivo and is thereby protected from the antimetabolite/NBMPR treatment 
by the /ENTP-dependent nucleoside salvage pathway. In contrast, untransduced cells and 
tumor cells which do not express /ENTP are selected against, due to the blockage of both 

25 their de novo nucleoside biosynthesis and their NBMPR-sensitive nucleoside salvage 

pathway by the nucleoside antimetabolite/NBMPR treatment. In the case of the nucleoside 
analogs, the transduced hematopoietic stem cells are protected by the efflux of the nucleoside 
analogs, via the /ENTP-dependent transport pathway. Untransduced cells and tumor cells are 
selected against due to the retention of the nucleoside analog and its active metabolites in the 

30 cells when the NBMPR-sensitive transport pathway is blocked by NBMPR. 
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The dosage of NBMPR to be administered is empirically determined by studies in a murine 
models first, as described Materials and Methods, and subsequently in higher mammals and 
humans. The use of a nucleoside transporter to provide protection in hematopoietic stem 
cells provides for the use of any antimetabolite, and in particular, methotrexate, trimetrexate, 
5 PALA, 2CDA, ddC, and 5-fluorouracil which have already been used in the clinic for the 
treatment of neoplastic and vjral disorders. 

Figure 5 is a schematic depicting the in vivo results following the ex vivo transduction of 
normal CD34 positive hematopoietic stem cells with viral vectors containing a nucleic acid 
encoding hENT2. Re-transplantation of the transduced cells is followed by intraperitoneal ly 

10 administering an antimetabolite (MTX, trimetrexate, 5-FU or PALA) together with a 
nucleoside transport inhibitor (e.g., NBMPR or draflazine). The combination of the 
antimetabolite and the nucleoside transport inhibitor selectively enriches for the hENT2 
transduced cells which are resistant to both NBMPR and draflazine. These transduced cells 
are thereby protected from the antimetabolite/NBMPR regimen by the NBMPR-insensitive 

1 5 hENT2-dependent nucleoside salvage pathway which transports purine and pyrimidine 
nucleosides from exogenous pools through the plasma membrane. 

EXAMPLE 3 

Isolation of a BAC containing the /EN TP gene 

An /ENTP cDNA was used to screen a BAC library (Genome Systems, Inc.). Of the three 
20 clones identified, one clone contained both the 5' and 3' - UTR (SEQ ID NO:5 and SEQ ID 
NO: 10) as determined by Southern Analysis and PCR. 

PGR sequencing of the BAC using /ENTP primers identified 1 1 introns from the 5' UTR to 3' 
UTR nucleic acid. A BAC-specific primer for /ENTP was used to identify a 2.4 kilobase 
sequence 5 prime to the cDNA (SEQ ID NO:6). In all, the /ENTP gene was found to have at 
25 least 1 1 kilobases with 12 exons and 1 1 introns (Figure 6). 
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EXAMPLE 4 



Alternative splice sjres 

An alternative splice variant was identified in Thymus cells. A nucleic acid was identified 
having the nucleotide sequence of SEQ ID NO:9. A nucleotide sequence of SEQ ID NO: 7 
5 was then deduced which has the amino acid sequence ©f SEQ ID NO:8. A schematic 

drawing of the alternatively spliced nucleic acid is shown in Figure 7. The identification of 
this alternative splice site indicates that there is tissue specific regulation of /ENTP 
expression with alternative splicing. 

While the invention has been described and illustrated herein by references to the specific 
10 embodiments, various specific material, procedures and examples, it is understood that the 
invention is not restricted to the particular material combinations of material, and procedures 
selected for that purpose. Indeed, various modifications of the invention in addition to those 
described herein will become apparent to those skilled in the art from the foregoing 
description and the accompanying figures. Such modifications are intended to fall within the 
scope of tine appended claims. 

It is further to be understood that all base sizes or amino acid sizes, and all molecular weight 
or molecular mass values, given for nucleic acids or polypeptides are approximate, and are 
provided for description. 



Various publications, patent applications and patents are cited herein, the disclosures of 
which are incorporated by reference in their entireties. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Belt, Judith A. 

Crawford, Charles R. 
Patel, Divyen 

(ii) TITLE OF INVENTION: A NITROBENZYLMERCAPTOPURINERIBOS IDE 

(NBMPR) -INSENSITIVE, EQUILIBRATIVE , NUCLEOSIDE TRANSPORT 
PROTEIN, NUCLEIC ACIDS ENCODING THE SAME AND METHODS OF 
USE 

(iii) NUMBER OF SEQUENCES: 22 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : David A. Jackson, Esq. 

(B) STREET: 411 Hackensack Ave, Continental Plaza, 4th 

Floor 

(C) CITY: Hackensack 

(D) STATE: New Jersey 

(E) COUNTRY: USA 

(F) ZIP: 07601 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
(B> COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 
{B) FILING DATE: 
(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Jackson Esq., David A. 

(B) REGISTRATION NUMBER: 26,742 

(C) REFERENCE/DOCKET NUMBER: 1340-1-013 PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 201-487-5800 

(B) TELEFAX: 201-343-16B4 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2522 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



-<xi) SEQUENCE DESCRIPTION : SEQ ID NO:l: 

GGATCCGCGG TGGCGCGACC CTCTGTCCCC GCCTCGGGGC GGAGCCCAGG TCCCAGCCTG 60 

CGGAGCGCGA GACACGCCGA AATCCGCCCG AGGCTACCTG TGCGACTCCA GCCGCCCTGC 12 0 

ACCGGAATCT GGGGAGACCC GCCCCCCGCC CCACCGGTCT GCGGCCCTCC GCCCCAGCGC 18 0 

AGGTGCAGGT GCGGCTTCTC TGCCCCTTTC ACCCCAGGCG CATCCGCCGC GGCGGCCATG 24 0 

GCGCGAGGAG ACGCCCCGCG GGACAGCTAC CACCTGGTCG GGATCAGCTT CTTCATCCTG 300 

GGGCTGGGCA CCCTCCTTCC CTGGAACTTC TTCATCACCG CCATCCCGTA CTTCCAGGCG 360 

CGACTGGCCG GGGCCGGCAA CAGCACAGCC AGGATCCTGA GCACCAACCA CACGGGTCCC 420 

GAGGATGCCT TCAACTTCAA CAATTGGGTG ACGCTGCTGT CCCAGCTGCC CCTGCTGCTC 4 80 

TTCACCCTCC TCAACTCCTT CCTGTACCAG TGCGTCCCGG AGACGGTGCG CATTCTGGGC 54 0 

AGCCTGCTGG CCATACTGCT GCTCTTTGCC CTGACAGCAG CGCTGGTCAA GGTGGACATG 6 00 

AGCCCCGGAC CCTTCTTCTC CATCACCATG GCCTCCGTCT GCTTCATCAA CTCCTTCAGT 66 0 

G CAGTCCT AC AGGGCAGCCT CTTCGGGCAG CTGGGCACCA TGCCCTCCAC CTACAGCACC 72 0 

CTCTTCCTCA GCGGCCAGGG CCTGGCTGGG AT CTTTG CTG CCCTTGCCAT GCTCCTGTCC 78 0 

ATGGCCAGTG GCGTGGACGC CGAGACCTCT GCCCTGGGGT ACTTTATCAC GCCCTGTGTG 84 0 

GGCATCCTCA TGTCCATCGT GTGTTACCTG AGCCTGCCTC ACCTGAAGTT TGCCCGCTAC 900 

TACCTGGCCA ATAAATCATC CCAGGCCCAA GCTCAGGAGC TGGAGACCAA AGCTG AG CTC 96 0 

CTCCAGTCTG ATGAGAACGG GATTCCCAGT AGTCCCCAGA AAGTAGCTCT GACCCTGGAT 1020 

CTTGACCTGG AGAAGGAGCC GGAATCAGAG CCAGATGAGC CCCAGAAGCC AGGAAAACCT 1080 

TCAGTCTTCA CTGTCTTCCA GAAGATCTGG CTGACAGCGC TGTGCCTTGT GTTGGTCTTC 114 0 

ACAGTCACCC TGTCCGTCTT CCCCGCCATC ACAGCCATGG TGACCAGCTC CACCAGTCCT 12 00 

GGGAAGTGGA GTCAGTTCTT CAACCCCATC TGCTGCTTCC TCCTCTTCAA CATCATGGAC 12 60 

TGGCTGGGAC GGAGCCTGAC CTCTTACTTC CTGTGGCCAG ACGAGGACAG CCGGCTGCTG 1320 

CCCCTGCTGG TCTGCCTGCG GTTCCTGTTC GTGCCCCTCT TCATG CTGTG CCS$E0TGCCC 13 80 

CAGAGGTCCC GGCTGCCCAT CCTCTTCCCA CAGGATGCCT ACTTCATCAC CTTCATGCTG 1440 
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CTCTTTGCCG TTT CTAATGG CTACCTGGTG TCCCTCACCA TGTGCCTGGC GCCCAGGCAG 

GTGCTGCCAC ACGAGAGGGA GGTGGCCGGC GCCCTCATGA CCTTCTTCCT GGCCCTGGGA 

CTTTCCTGTG GAGCCTCCCT CTCCTTCCTC TTCAAGGCGC TGCTCTGAAG TGGCCCCTCC 

AGGCTCTTTG GCAGCCTCTT CTCGACGTCT CCTTCCGGAG CTGAGATCCA GCCCAGGGCG 

AATGGCG AG C TTGGCTCAGG CCTCTGCGGG GTGGAGGCCC CTGGGCCTGA GGCTGCCAGC 

AGCGGGCAGG AGCTGCTCTT CATCCACTTG GAGTGCTGCG GGGAAGAAAT CACCACCGGT 

CATTCTAACC CTCACCCAGG AATGGGGGTG ACTCGCACAA GACCTCATGG AAAGGGTGAT 

GACTAGGGAA AAGAGGGTGC AGGGCACGGC TGCTCCCCAC CACCAGGTCT GCATTTGTTC 

AT CAT CATC A GGAGCAGAGG TGACCAGAGG GTTCAGAGTG GGAGGCAGGG CCAGCCCAGG 

CCAGGAGCGC CTCATCTTCC CAGGCCTCAG CCACCCAGGG TAAAAGG TG C CAGGGAAGTT 

GTGGGCACCT GAG AGG AG G A AC AG ATGTG G AGGACCTGAG GGTGCTCAAA GGGCCAGGCT 

CAGCCTCAAG CAGTGTTTTC ATTGCCAACA C TTACTGT AC CCACTCCGCA GAGCCCCGCT 

GGGCCTGGGC CCCAGGGCCA CAGCTAGCCT GCATGTGTGT ACTGCACTTT AC AGT TTGCA 
AAGCTCTTCC ATACCCACTC TCTCACCGAA G C CTAATTG A GGCTCTTGGA AGG AG TCAGG 

CAAGGATTGT GCTTCCCCCA TTATACAGGT GACAAAACTG AGTCCTGGGG AAAGTG AC TG 

GTCCGTGGTA GAGCCGGGAC CCAATCCCCT CTCTCTCCTC CCTGTTGGTG CTGTTCTTCC 

TGCCCAACAC CTGTTTCTCT TTTCCTCAAG GGGTTTGGGG CAGGAGCCTG GGCACTTACT 

CCCCGTTTTT GCTGTTTCTC CTTCTGACCC TGCTCTTGGG TCTAATAACC CCATTTATTT 
GT 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 456 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(A) DESCRIPTION: hENT2 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2522 



(A) ORGANISM: Homo sapiens 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Ala Arg Gly Asp Ala Pro Arg Asp Ser Tyr His Leu Val Gly He 
1 5 .10 15 

Ser Phe Phe He Leu Gly Leu Gly Thr Leu Leu Pro Trp Asn Phe Phe 
20 25 30 

He Thr Ala He Pro Tyr Phe Gin Ala Arg Leu Ala Gly Ala Gly Asn 
35 40 45 

Ser Thr Ala Arg He Leu Ser Thr Asn His Thr Gly Pro Glu Asp Ala 
50 55 60 

Phe Asn Phe Asn Asn Trp Val Thr Leu Leu Ser Gin Leu Pro Leu Leu 
65 70 75 80 

Leu Phe Thr Leu Leu Asn Ser Phe Leu Tyr Gin Cys Val Pro Glu Thr 
85 90 95 

Val Arg He Leu Gly Ser Leu Leu Ala He Leu Leu Leu Phe Ala Leu 
100 105 110 

Thr Ala Ala Leu Val Lys Val Asp Met Ser Pro Gly Pro Phe Phe Ser 
115 120 125 

He Thr Met Ala Ser Val Cys Phe He Asn Ser Phe Ser Ala Val Leu 
130 135 140 

Gin Gly Ser Leu Phe Gly Gin Leu Gly Thr Met Pro Ser Thr Tyr Ser 
145 150 155 160 

Thr Leu Phe Leu Ser Gly Gin Gly Leu Ala Gly He Phe Ala Ala Leu 
165 170 175 

Ala Met Leu Leu Ser Met Ala Ser Gly Val Asp Ala Glu Thr Ser Ala 
180 185 190 

Leu Gly Tyr Phe He Thr Pro Cys Val Gly He Leu Met Ser He Val 
195 200 205 

Cys Tyr Leu Ser Leu Pro His Leu Lys Phe Ala Arg Tyr Tyr Leu Ala 
210 215 220 

Asn Lys Ser Ser Gin Ala Gin Ala Gin Glu Leu Glu Thr Lys Ala Glu 
225 230 235 240 

Leu Leu Gin Ser Asp Glu Asn Gly He Pro Ser Ser Pro Gin Lys Val 
245 250 255 

Ala Leu Thr Leu Asp Leu Asp Leu Glu Lys Glu Pro Glu Ser Glu Pro 
260 265 270 



Asp 



Glu 



Pro Gin Lys Pro Gly Lys Pro Ser Val Phe Thr Val Phe Gin 
275 280 285 
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Lys lie Trp Leu Thr Ala Leu Cys Leu Val Leu Val Phe Thr Val Thr 
290 295 300 

Leu Ser Val Phe Pro Ala lie Thr Ala Met Val Thr Ser Ser Thr Ser 
305 310 315 320 

Pro Gly Lys Trp Ser Gin Phe Phe Asn Pro lie Cys Cys Phe Leu Leu 
325 330 335 

Phe Asn lie Met Asp Trp Leu Gly Arg Ser Leu Thr Ser Tyr Phe Leu 
340 345 350 

Trp Pro Asp Glu Asp Ser Arg Leu Leu Pro Leu Leu Val Cys Leu Arg 
355 360 365 

Phe Leu Phe Val Pro Leu Phe Met Leu Cys His Val Pro Gin Arg Ser 
370 375 380 

Arg Leu Pro lie Leu Phe Pro Gin Asp Ala Tyr Phe lie Thr Phe Met 
385 390 395 400 

Leu Leu Phe Ala Val Ser Asn Gly Tyr Leu Val Ser Leu Thr Met Cys 
405 410 415 

Leu Ala Pro Arg Gin Val Leu Pro His Glu Arg Glu Val Ala Gly Ala 
420 425 430 

Leu Met Thr Phe Phe Leu Ala Leu Gly Leu Ser Cys Gly Ala Ser Leu 
435 440 445 

Ser Phe Leu Phe Lys Ala Leu Leu 
450 455 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 326 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(A) DESCRIPTION: hHNP3 6 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Ala Ser Val Cys Phe lie Asn Ser Phe Ser Ala Val Leu Gin Gly 
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1 5 10 15 

Ser Leu Phe Gly Gin Leu Gly Thr Met Pro Ser Thr Tyr Ser Thr Leu 
20 25 30 

Phe Leu Ser Gly Gin Gly Leu Ala Gly lie Phe Ala Ala Leu Ala Met 
35 40 45 

Leu Leu Ser Met Ala Ser Gly Val Asp Ala Glu Thr Ser Ala Leu Gly 
50 55 60 

Tyr Phe lie Thr Pro Tyr Val Gly lie Leu Met Ser He Val Cys Tyr 
65 70 75 80 

Leu Ser Leu Pro His Leu Lys Phe Ala Arg Tyr Tyr Leu Ala Asn Lys 
85 90 95 

Ser Ser Gin Ala Gin Ala Gin Glu Leu Glu Thr Lys Ala Glu Leu Leu 
100 105 110 

Gin Ser Asp Glu Asn Gly He Pro Ser Ser Pro Gin Lys Val Ala Leu 
115 120 125 

Thr Leu Asp Leu Asp Leu Glu Lys Glu Pro Glu Ser Glu Pro Asp Glu 
130 135 140 

Pro Gin Lys Pro Gly Lys Pro Ser Val Phe Thr Val Phe Gin Lys He 
145 150 155 160 

Trp Leu Thr Ala Leu Cys Leu Val Leu Val Phe Thr Val Thr Leu Ser 
165 170 175 

Val Phe Pro Ala He Thr Ala Met Val Thr Ser Ser Thr Ser Pro Gly 
180 185 190 

Lys Trp Ser Gin Phe Phe Asn Pro He Cys Cys Phe Leu Leu Phe Asn 
195 200 205 

He Met Asp Trp Leu Gly Arg Ser Leu Thr Ser Tyr Phe Leu Trp Pro 
210 215 220 

Asp Glu Asp Ser Arg Leu Leu Pro Leu Leu Val Cys Leu Arg Phe Leu 
225 230 235 240 

Phe Val Pro Leu Phe Met Leu Cys His Val Pro Gin Arg Ser Arg Leu 
245 250 255 

Pro He Leu Phe Pro Gin Asp Ala Tyr Phe He Thr Phe Met Leu Leu 
260 265 270 

Phe Ala Val Ser Asn Gly Tyr Leu Val Ser Leu Thr Met Cys Leu Ala 
275 280 285 

Pro Arg Gin Val Leu Pro His Glu Arg Glu Val Ala Gly Ala Leu Met 
290 295 300 
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Thr Phe Phe Leu Ala Leu Gly Leu Ser Cys Gly Ala Ser Leu Ser Phe 
305 310 315 320 

Leu Phe Lys Ala Leu Leu 
325 

( 2 ) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 456 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS :• single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(A) DESCRIPTION: hENTl 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Homo sapiens 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Met Thr Thr Ser 
1 

He Phe Phe Met 

20 

Met Thr Ala Thr 
35 

Val Ser Leu Val 
50 

Ala Pro Ala Ala 
65 

Asn Asn Val Met 



Tyr Leu Asn Ser 
100 

Leu Gly Ser Leu 
115 

Leu Val Lys Val 
130 

He Lys lie Val 



His Gin Pro Gin 
5 

Leu Gly Leu Gly 



Gin Tyr Phe Thr 
40 

Thr Ala Glu Leu 
55 

Pro Leu Pro Glu 
70 

Thr Leu Cys Ala 
85 

Phe Leu His Gin 



Val Ala He Leu 
120 

Gin Leu Asp Ala 
135 

Leu He Asn Ser 



Asp Arg Tyr Lys 
10 

Thr Leu Leu Pro 
25 

Asn TLrg Leu Asp 



Ser Lys Asp Ala 
60 

Arg Asn Ser Leu 
75 

Met Leu Pro Leu 
90 

Arg He Pro Gin 
105 

Leu Val Phe Leu 



Leu Pro Phe Phe 
140 

Phe Gly Ala He 



Ala Val Trp Leu 
15 

Trp Asn Phe Phe 
30 

Met Ser Gin Asn 
45 

Gin Ala Ser Ala 



Ser Ala He Phe 
80 

Leu Leu Phe Thr 
95 

Ser Val Arg He 
110 

He Thr Ala He 
125 

Val He Thr Met 



Leu Gin Gly Ser 
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Leu Phe Gly Leu Ala 
165 

Met Ser Gly Gin Gly 
180 

Cys Ala lie Ala Ser 
195 

Phe lie Thr Ala Cys 
210 

Gly Leu Pro Arg Leu 
225 



75 

150 155 

Gly Leu Leu Pro Ala Ser 
170 

Leu Ala Gly Phe Phe Ala 
185 

Gly Ser Glu Leu Ser Glu 
200 

Ala Val lie He Leu Thr 
215 

Glu Phe Tyr Arg Tyr Tyr 
230 235 
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160 

Tyr Thr Ala Pro He 
175 

Ser Val Ala Met lie 
190 

Ser Ala Phe Gly Tyr 
205 

lie lie Cys Tyr Leu 
220 

Gin Gin Leu Lys Leu 
240 



Glu Gly Pro Gly Glu Gin Glu Thr 
245 

Glu Glu Pro Arg Ala Gly Lys Glu 
260 

Ser Gin Pro Thr Asn Glu Ser His 
275 280 

He Ser Val Leu Ala Phe Ser Val 
290 295 



Lys Leu Asp Leu He Ser Lys Gly 
250 255 

Glu Ser Gly Val Ser Val Ser Asn 
265 270 

Ser He Lys Ala He Leu Lys Asn 
285 

Cys Phe He Phe Thr He Thr He 
300 



Gly Met Phe Pro Ala Val Thr Val Glu Val Lys Ser Ser He Ala Gly 

305 310 315 320 

Ser Ser Thr Trp Glu Arg Tyr Phe He Pro Val Ser Cys Phe Leu Thr 

325 330 335 



Phe Asn He Phe Asp Trp Leu Gly 
340 

Trp Pro Gly Lys Asp Ser Arg Trp 

355 360 

Leu Val Phe Val Pro Leu Leu Leu 

370 375 



Arg Ser Leu Thr Ala Val Phe Met 
345 350 

Leu Pro Ser Leu Val Leu Ala Arg 
365 

Leu Cys Asn lie Lys Pro Arg Arg 
380 



Tyr Leu Thr Val 
385 

Ala Ala Phe Ala 



Phe Gly Pro Lys 
420 

He Met Ala Phe 
435 



Val Phe Glu His 
390 

Phe Ser Asn Gly 
405 

Lys Val Lys Pro 



Phe Leu Cys Leu 
440 



Asp Ala Trp Phe 
395 

Tyr Leu Ala Ser 
410 

Ala Glu Ala Glu 
425 

Gly Leu Ala Leu 



He Phe Phe Met 
400 

Leu Cys Met Cys 
415 

Thr Ala Gly Ala 
430 

Gly Ala Val Phe 
445 
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Ser Phe Leu Phe Arg Ala lie Val 
450 455 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
GGATCCGCGG TGGCGCGACC CTCTGTCCCC GCCTCGGGGC GGAGCCCAGG TCCCAGCCTG 
CGGAGCGCGA GACACGC CG A AATCCGCCCG AGGCTACCTG TGCGACTCCA GCCGCCCTGC 
ACCGGAATCT GGGGAGACCC GCCCCCCGCC CCACCGGTCT GCGGCCCTCC GCCCCAGCGC 
AGGTGCAGGT GCGGCTTCTC TGCCCCTTTC ACCCCAGGCG CATCCGCCGC GGCGGCCATG 
GCGCGAGGAG ACGCCCCGCG GGACAGGTGA GTGGGCCCGG GTGCGTTGCA AGTGGCCGGG 
GGCGTTGCAG ACCCGCTCCC TGAAGGCGCT CCGAGGCTCA GAGAAGACCG GATCGAACTA 
CAATTCCCAT CAGCCGACTC CCTCGGCCGC CGGAACCGGG GTGATGGGGG TTGTAGTCCG 
CTCCGGAGGG GGTGGCCTGG GAGGCGG GAG GGCCTGCGGA GGCGGGTGCG TCCTCGGGGT 
GACCTTCCCA CCGATCCCCA CAGCTACCAC CTGGTCGGGA TCAGCTTCTT CATCCTGGGG 
CTGGGCACCC TCCTTCCCTG GAACTTCTTC ATCACCGCCA TCCCGGTGAG ACTCCTGGCG 
GCGTGGCAGC CTCGTGGCCA CAGCCAGCAC CCCTCCCTCC AGCCCTTTGG ATGAAGCTTA 
GCGGGCGCTT CCTCACCGCG CACCTGTGAC CCCTCGTTGA GCTCATTATG GGCTGAAGCT 
CGGAGAGGGG AATGCTTCCC CCGTGGCTAG GAAAGTAGAA CTTCAGCAAT GCCCACATCT 
CAGAGGGGCA AGGCCACCAG CCCACAGGGT C TGGAATG AG CAAAGGCGCT GCCACCCCTC 
TCTGTGTGTC GTTATTCCTG AGTCAGTCAC CCCAAAAGTC GGTTATCGAA CGTTTGATTT 
TCTTTGAAAT AC C ATGAATT TCATTCATTC ACT C ATTC AT TCAACAAACT TTTTTTTTTT 
TTTTTTTTTT GAG AC AGAG T CTGGCTCTTT CGCCCAGGCT GGAGTGCAGT GGCGCCATCT 
CGGCTCACTG CAAGCTCCGC CTCCCGGGTT CACGCCATTC TCCTGCCTCA CCCTCCCGAG 
TAG CTGGG AC TACAGGCGCC CGCCGCCACG CGCGGCTATT TTTTTTTGTA TTTTTAGTAG 
AGACGGGGTT TCACCGTGTT AGCCAGGATG GTCTCGATCT CCTGACCTCG TGATCCTCCC 
GCCTCGGCCT CCCAAAGTGC TGGGATTACA GGCGTGAGCC ACCGTGCCCA GCCTCATTCA 
ACAAACTTTT AGTGTGCATC TAC TGTGGAG CAGGCACTGG GGACACAGGA GGAAACAGCA 
GGGAGGCTCT TCAGGGAAGG CAGAAATGTG GGGTTTG CAT TGTCTTTGGG AC CGGGTTAT 
TC AT CTGTAT TCACTGCAAC AACTTTGCAA ATGCTTCTTG GGTACTGGCT CTGTG CTGGG 
CCCTGGAAAC CCAGAGATGA ATCAGCCCCT GGG CTTGAG A GCAAGAAGGG GCCAAAGAGC 
TATTAATAAT GTAACATGAT GCGTGACATT CCAGGCTTGC AGCAGAGTGC AGTGGGTCCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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CAGGGAAGGA GAAAGTTCCT TCTGCTTCAT GGAAGAGGAG ATTTGTAAAT TGGGAGTAGG 162 0 

GTAGGCAAAG TGCGTGTGGA GGGGTGTGGT CAGTAGGGCA TTCCAAGCCG AGGCGACAGC 168 0 

CATGCCAAAG GCAGGCAGGC AAGAGACGAT CAGCCTGTTT AGAGGGAGAT TCCACAGCCA 174 0 

GGGCTGCCTG GAG CTTAG C A GGATGGAGCA GAAGATGGGG CACAAAGGGA GACTAGGATC 18 00 

TGATTCTGAA GAGCTGTTCC ATTTGGGGCT TTGCCCTGCA GGCAATAGGG AGGCATGAAT 1860 

CGGGGTGTTG AGGAGTGAGG AGGTTAAGCA GAGGAGTGGC AGGCTATGTG CTCTAGAGAG 192 0 

AATG CAGTTG TTCAGCACTT AGGCCAAAGC CTGGCTTGAC AG TAG G CG CT CAATAAATAC 198 0 

CCGTGGAATG AATGAATGTA GCAGCTGCTG CAGGAGTGGG GATGGGGGCT GGAACCAGGG 2 04 0 

CACTGAAGAG GAGGGGCCGT CCAAGGCTGG ATCGAGGCTC TTGCTGGGGG CTCTTAATGC 2100 

TTAGGCTGTG TCCCAGACTT CAGCCATTTA ACTCAGCACA CATTCATGGA GCTCCGACTG 2160 

TGTGCCGGCT GCTGGGAACA GAACAAGGGC AAGGCAGACA AGTCCCCACA GACATTAGAA 2220 

AGCAATCACA AG TGAGGGGG AAGGCAGCGG GGGAAGGCTG AGAGGTGCTG ACCCTCCACC 228 0 

ACCTCCCTAC CTGG CAGTAC TTCCAGGCGC GACTGGCCGG GGCCGGCAAC AGCACAGCCA 2 34 0 

G GATCCTG AG CACCAACCAC ACGGGTCCCG AG GATG CCTT CAACTTCAAC AATTGGGTGA 2400 

CGCTGCTGTC CCAGCTGCCC CTGCTGCTCT TCACCCTCCT CAACTCCTTC CTGTACCAGT 24 6 0 

GGTGAGAGGC CTGCCCTGGC TCCTGCGCCC TCTGCCGAGG CAGCTTCATT GAGGCCCTCC 2 52 0 

CCTGCGCCCC CTGCCCTCCA GCCCTACTGC CCAGCCCCAG GTGTCGAGCC TCCTTCCCCA 258 0 

GCCCCCTCTG GCCTGGGCCC CACTGATGCA CTCTGCCTGC TTCTGAGCAA GCGTCCCGGA 2 64 0 

GACGGTGCGC ATTCTGGGCA GCCTGCTGGC CATACTGCTG CTCTTTGCCC TG AC AG C AG C 270 0 

GCTGGTCAAG GTGGACATGA GCCCCGGACC CTTCTTCTCC ATCACCATGG CCTCCGTCTG 2 76 0 

CTTCATCAAC TGTGAGCACC TCCACCCCCT CTCCAGCCAG CCTATGCAGG GCTTCAGCCT 2 82 0 

GGCCTCATCA TTGAAAGGGC CCAGCATATC CGAGAAGGGC AGACAGCATC ATGGTCGCTC 28B0 

ATATCCCTGG TGAAGAAACT GAGGCCCACA GGGAGGGGAA GAGTCACTTG TCCGGTGACC 2 94 0 

TAGGAGCAGG CCTCCTGGTC AACAGCCCCA C AG AC C AATG GCTGCACCTC AGAAGAGGAC 300 0 

TGAATAGCGG GTGTTGCCCC CGAGTGCTCA GAGTCCCTAG GGAAGCTCAC ACCTGCGCAA 306 0 

CCTTGTCCAG AGTCCCCTGT GTATCCTGCC GGCACCTCCT CCAGGGAGCC TCTGAGTCTT 312 0 

GCCTAGTTGA GCAGCAGCCC CCATCCCTGT CCTCCACAGC CTTCAGTGCA GTCCTACAGG 318 0 

GCAGCCTCTT CGGGCAGCTG GGCACCATGC CCTCCACCTA CAGCACCCTC TTC CTCAGCG 324 0 
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GCCAGGGCCT GGCTGGGATC TTTGCTGCCC TTGCCATGCT CCTGTCCATG GCCAGTGAGT 3 300 

GCACTTGGGT GGCTGGGAGG GCTGGGGTGG CCTCTGAGGT TTGGGGAAGA GAGAGGGCAT 3 360 

GTGAGAGCAA GACACATGGG TTCTGGGTGA AGATGGAGGT AAGCGGGTGA TATGGAAATG 342 0 

GGGATTGGTC TGGGG CTAGG GAATGGGGCT CATGGGCCCT GCAGTGAGGA GTAATAACCA 34 80 

AGTGAGGACT GGGTTAACTC AGGGACAGGG GCAGGATTCC TGGGGCTAAT ACTGGCATGT 3 54 0 

GGCAGCAGGT TGAAGTTGAA GGATAAGGGG ATGGGTTTGG GATTCAGATA GTCTTGGGTT 3600 

TGAATCTGCT TCACCGCTTA CCAGCTAGGG GTGTTGGACA AGG CTTGTC A CCTCTCTGTG 3660 

CTAGCTTCCC CACCGATGTG ATTGGTACAG CTCCCTGCTC AGATTGTAAT GAGCATGCAA 3 720 

TGAGAGAAGG CTACTGGCAC ATAGTACGTG CTCAACAAAA ATGACACATG GGGAAAGTGA 3 780 

GAGAAGTGCA GGGCTGCTCT GGGGCCCTGT ACAAGATTCC CATTTGTCAG TGAAGGGAGG 3840 

AG CGGAAGAG GCTGGGAGTG GGTCTGAGAA GTACACAATG GGAAGTGGGA CAAGAGTTGG 3900 

AAGCCCCGTG GGAGCCGGCG GGACCAGGTG CCTCTCTTCT GCAGCTGAAG TTCCTCCGCA 396 0 

GGTGGCGTGG ACGCCGAGAC CTCTGCCCTG GGGTACTTTA TCACGCCCTG TGTGGGCATC 402 0 

CTCATGTCCA TCGTGTGTTA CCTGAGCCTG CCTCACCTGG TGAGCCTGCT GTTGGGCTCG 4080 

AGGCCCCACC TCAAAGCATC TTGGATAGAG TCCTGAGCCT GAAGCCCTGA GAGAGGCCAG 414 0 

GGGAGG TGG A GGAGACCTGG TCTCAGCCCT GACCCCCAGA GAAGACACTG AGGGGCCCCA 4200 

GCCTCCAGGC CAATGGTATG GGGAGG GAT C CAGACACCTC AGGCAAGCCA GGCAGGCCCA 426 0 

ACACTTTCCT GTCCTTCTGC AGAAGTTTGC CCGCTACTAC CTGGCCAATA AATCATCCCA 43 2 0 

GGCCCAAGCT CAGGAGCTGG AGACCAAAGC TGAGCTCCTC CAGTCTGGTA AG C C CTGAG A 43 8 0 

CCCTCCTGGG GAGGTGGGAG ATGCAGAGGA AG CTAG AG C C ACCTCCCCTG GGAAG CTGTT 444 0 

CCATCTGTTC CCAGCCAGAG CCCACCCCTA GTAGCCTTGT GCAAACAGGA AGATCATGAA 4 500 

GGGAAGTTGG TAGGATTAAA GTCATCCCTG CTGTTGTTTG GGCCTCAGTT TCCACCTCTA 4 56 0 

TAAAATGGGG AGGCGACAGA AGTTCCATGC ATGCAAACTT TGGATCGAAG ACCTCTGAAT 4 620 

TGGAATACTA GTTTCACAAC ATCCCRGCTG TGTGGCCTGA GACAAACCAC TTAGCCACTG 46 8 0 

CACCCCTCTG AACCTCAATG TGTCATTTGT AAAGCAATGG TAATGAGATA ATCCATCTAA 4 74 0 

GGTGCTTCGC TCATCACCCG ACCCATGCAC GCGCTTCTGG TAGCTATGCA TATTTCCATC 48 00 

ATGAATTCCC TTCGCCTGCA GCCTCAGCTT AGGCTGGAGG AAGATCACCT TTTTTTGTTT 48 60 

TGGGGTGAGG GGGTTGTTGT TATTTTGAGT CAGGATCTCA CTCTGTCACC TAGGCTGCAC 4 920 
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TGCAGTGCTA TCACAACTCA ACTGCAGCTT CGACCTTCTG GGCTCAAGTG AGCCACCTCA 4 98 0 

GTCTCCCGAG TAG C TGGGAC TATAGGTGCA GGCTGCCATG CCCGGCTAAT TTTTTTATTT 504 0 

TTGTAGAGAT GGTGATTCAC CATGTTGCCC AGGCTGGTCT CGAACTCCTG GGCTCAAGCA 510 0 

ATATGCCCGC TTCGGCCTTC CAAAATGTTG GGATTACAGG CGTGAGCCAC CATGCCGAGC 516 0 

TGAGGATCAC TTGTTTTAAC TGCTGGGAAT CTCCCTTCGT TGGGCCTGGC TGTCGGGAAA 5220 

CCTGGGTCAC AAGCATGACC CTTCCCCGTC CCCCCTCACC C C AGATG AG A ACGGGATTCC 52 BO 

CAGTAGTCCC CAGAAAGTAG CTCTGACCCT GGATCTTGAC CTGGAGAAGG AGCCGGAATC 5340 

AGAGCCAGAT GAGCCCCAGA AGCCAGGAAA ACCTTCAGTC TTCACTGTCT TCCAGAAGGT 54 0 0 

TTGGCTTGGA TACAGCCCCC AACCACCATC TTTGGGGAAG AATGGGGCTC ACATTGACTC 54 6 0 

CAAGGTCATA GGGTCACAGT GGGTCAGGGA CACAGCTGGG CCAGGCCCCA AGTGTCCTGC 5520 

TCCGACATGG GGCTTGGGCA AGAGGGTGGG GCCCTGGGAC TGCCCTGCCT GCTCACACCC 558 0 

CTGCCTCYGG CTCCCAGATC TGGCTGACAG CGCTGTGCCT TGTGTTGGTC TTCACAGTCA 564 0 

CCCTGTCCGT CTTCCCCGCC ATCACAG CCA TGGTGACCAG CTCCACCAGT CCTGGGAAGT 57 00 

GGAGTGAGTG TCAGGGTGGA GAAGACGGCA GGG CAGGGGG TACAAAGGGG AGAGGACGGG 5760 

AGAGGGGAGT TGGAGACCAG TATGAGCTGC AGCCGTTTCC CTCCCAGGTC AGTTCTTCAA 582 0 

CCCCATCTGC TGCTTCCTCC TCTTCAACAT CATGGACTGG CTGGGACGGA GCCTGACCTC 58 80 

TTACTTCCTG TGGGTAAGCA CACCAGGGCT GGGTGATCCG ATGTTTTAGG AAGCAGTTTG 594 0 

GGATC CGAGG GCTTGAAAGA GCACGGAGGT GATTTTCTGG TAGTCCAAGT GGCCTGGTAA 6 000 

TGCAACCACT GGCCAAGCAG CAGGGAGCAC TTGGGCCCTG GAGGCGTGCA AGGCCAGGGC 6060 

TTGCACTGTG AGCTCCCTGA AAGCAAAAAT CATGTCCAGC TGACCTCTGT GTCCCCAGCA 6120 

TCCAGCCTTT GCTGCTCAGA GAATGTTACA TGGAGGTTCC TGCACCAGGT GAGGGACTGA 6180 

GCAAGATCTT AGTTTTGGGG TTGGTTTTAG CCATGGTGCT GTATCTTTAA ATGAAATCTT 624 0 

CCAAAGAGAC AATACATAAC GCAGGTGAAA GANGANCTGG TCTCATCCAA GTCAGGACAN 63 00 

NGAGCTGATC TACAGCTTCC AATCCCACTC AGAAAC CCTC TGCCCCCAAN GGGG 6354 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2140 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

CATAGGGACC TAGTAAAACA AATTAAAGTG GTAGTTATTT GGATCAGTGA TAACTAATTC 60 

TTATAATTTT TGCAGATGGA ATTGGCTG CT CTAGAAAAAA TTAAATCTAC TTGGATTAAA 12 0 

AACCAAGATG ACAGCTTGAC TGAAACAGAC ACTCTG GTAT GTATGGGTCA GTTTCCTGTT 180 

TCAGCTGTTT CAAATAGTGT TTGTCCCTTT AGAAATAACG GCAGAAGGAC CCTCAGGACC 24 0 

ACCATAGAAA TTTCACCTAA ATCTGCAGGC TTATGAATGT CCTGCACTCT CNTTCTCCTG 300 

AAATC CTTAC CCCGTGGAAT GCAACCTACT ACCTGGGTGT AG AC AC C AAG TTGCTCTCAA 36 0 

ACTTAGTATA CCAGAAATGT CCTCATTCTG CCCTTTAATA AGAGCTGACC AAATGCTAGC 42 0 

TGGGGAAACT TCTCACCATC TGTCACCAGC GTTCTCCCTG GAAAACATCC CTTCCTCAAT 48 0 

TGGGATGTTG CTGTTTCCCG TGGGCCAAGG CAACCCACAA CATTCAAGTC TCGACTGGTA 54 0 

CTGGTCCGTG TTC TTAAGAG GTGCTGGAGC TGCCCAAGGA GTGCAGGCTT AARCCCCAGT 6 00 

GAAGTGGAAT TGAGTTGGTT GGGATGCCCA GTTTTTTTAC AGGGTCGAAT TGCACAAACA 66 0 

TTTACTGTGC CCCTGCTTGT GTTGGG C AC T GAAGATGCAA ACATGAGTGA GCCACAGTTT 72 0 

GCATCCCCTG TACCTCCGGC CCCAGGGAGG TACAGGG GAT GCAAACTGGA GAAGCGACAT 78 0 

CTG AG CTGGG CTTTGCGGGT TACG TCAAAG TTCATCCCGT GGGACCAGAG GCAGGACCCT 84 0 

TGTGGGGAAG GAGCATAGAA CTTTACAGAA TGCCATCTAG GGATG AG CCT CACGGTGGGA 90 0 

CCTGCTGGGA GTTGACTAGA ATCTGTGAAA GAATTATTTT AAGCCTTATG TTTTCTATAG 960 

TAAATAAGAC TACATTTAAA GATCTTATGT ATTTAGGCTT G ATT C AAG AT TAATTTGAAA 1020 

CTCACTACCC TAACTTACAT TTTCTAGTTC ACC AG GTAAT CTGAATAATC CTACTTCCAC 1080 

CGTGGCCCCA CTGTAGTCCG TACTG CACGT GGCAAGTACA GTGTGSCCTT TTCAAAATTA 114 0 

AATTCCAATT GTGTCACTTC CTGATTAAAA CTCTTCAGTG ATTGGCCAAA TCTCAGCAAT 1200 

TTAATGTTGA GTGAGTAAAA AGAAGC CGGA ATGCCAAAAA ATGCACGCCA TAGGATTCCA 126 0 

GTTCTGTGAA ACTCACAAAC AGGCAAAACT AATCCATGAG GGTGACGTCA GCATACCTTG 1320 

TTACCCCAGG GGGAGAGGGA GGGGCATCGG GAGGC CTCAG GAATGCTGGA ACGTTCTGTC 13 8 0 

TTGATCTGGT TAATGGTCAC CTGGGGGCAT ATTTGCATAA AAATTCAAGT TGACTATTNT 144 0 



WO 98/46749 PCT/US98/07283 

81 

AGATTTGTGC TTCTTACTTT ATAGAAGTTA TTCCCTCAGT AAACATTTTG AAAACATAAA 1500 

G AC C AG GCAG AGGCAGGGAA GTAGG CAGGT GTGCGGCCTG T ATTGGTAG C AGAGTCCTCC 1560 

CTGAGGGCTG GATCATTAGG GAGGTAGTGG GCCCAGGGAG GAGGCACGGG AGGTTAATTT 1620 

AGAAAGGTGG CCCAGGCTGG GTCATGGTGG CCTCAGAGGC CCCACTAAAG AATCAGACTT 168 0 

GGCCAGGTGT GGTGGCTCAC ACC TGTAATC CCAGTACTTT TGGGAGGCTG AGGCAGGCAC 174 0 

ATTGTTTGAG CCTGGGGATT CAAGACCAGC CCGGGCAACA TGGGAAAAAC CCSGTCTCTA 180 0 

CAAAAAAAAA AAAAAAAAAA AATAGTAATA CAAAAAAATT AG C CAGGCAT GCTGACATGC 1860 

ACCTGTACTT GGAAGGCTGA GGTAAGATAA CCACTTG AG C CCAGGAGTTC AAGGCTG CAG 1920 

TGAGCTGAGA CCATGCCACT GCACTNCAGG CTGTGCAAGA GAGCAAGACC CTGTCTAAAA 1980 

AAAATTTAAA AGGATGTCAG GAATTAGGCT GGGGGCGGTG GTTCATGCCT GTAATC C CAG 2 04 0 

CACTTTGGGA GGCCGAGGCG GGCGGGTTCC GCAGGCCCCT AGGTGGGGCG GGGCCCGAGC 2100 

CAGAGTCGAG TCCCTGAGGC GGGTGGGGGA AGGAGAGACG 2140 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

CTAGAATTCA GCGGCCGCTG AATTCTAGGG CATCCTCATG TCCATCGTGT GTTACCTGAG 6 0 

CCTGCCTCAC CTGAAGTTTG CCCGCTACTA CCTGGCCAAT AAATCATCCC AGGCCCAAGC 12 0 

TCAGGAGCTG GAGACCAAAG CTGAGCTCCT CCAGTCTGGT AAGCCCTGAG ACCCTCCTGG 180 

GGAGGTGGGA GATGCAGAGG AAGCTAG AG C CACCTCCCCT GGGAAGCTGT TCCATCTGTT 24 0 

CCCAGCCAGA GCCCACCCCT AGTAGCCTTG TGCAAACAGG AAGATCATGA AGGGAAGTTG 3 00 

GTAGGATTAA AGTCATCCCT GCTGTTGTTT GGGCCTCAGT TTCCACCTCT ATAAAATGGG 36 0 

GAGGCGACAG AAGTTCCATG CATGCAAACT TTGGATCGAA GACCTCTGAA TTGGAATACT 4 20 

AGTTTCACAA CATCCCRGCT GTGTGGCCTG AGACAAACCA CTTAGCCACT GCACCCCTCT 4 80 
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GAACCTCAAT GTGTCATTTG TAAAGCAATG GTAATGAGAT AATCGATCTA AGGTGCTTCG 540 

. CTCATCACCC GACCCATGCA CGCGCTTCTG G TAG CTATGC ATATTTCCAT CATGAATTCC 6 00 

CTTCGCCTGC AGCCTCAGCT TAGGCTGGAG GAAGATCACC TTTTTTTGTT TTGGGGTGAG 6 60 

GGGGTTGTTG TTATTTTGAG TCAGGATCTC ACTCTGTCAC CTAGGCTGCA CTGCAGTGCT 72 0 

ATCACAACTC AACTGCAGCT TCGACCTTCT GGGCTCAAGT GAGCCACCTC AGTCTCCCGA 78 0 

GTAGCTGGGA CTATAGGTGC AGGCTGCCAT GCCCGGCTAA TTTTTTTATT TTTGTAGAGA 84 0 

TGGTGATTCA CCATGTTGCC CAGGCTGGTC TCGAACTCCT GGGCTCAAGC AATATGCCCG 90 0 

CTTCGGCCTT CCAAAATGTT GGGATTACAG GCGTGAGCCA CCATGCCGAG CTGAGGATCA 960 

CTTGTTTTAA CTGCTGGGAA TCTCCCTTCG TTGGGCCTGG CTGTCGGGAA ACCTGGGTCA 1020 

CAAGCATGAC CCTTCCCCGT CCCCCCTCAC CCCAGATGAG AACGGGATTC CCAGTAGTCC 1080 

CGAGAAAGTA GCTCTGACCC TGGATCTTGA CCTGGAGAAG GAGCCGGAAT CAGAGCCAGA 114 0 

TGAGCCCCAG AAGCCAGGAA AACCTTCAGT CTTCACTGTC TTCCAGAAGA TCTGGCTGAC 12 0 0 

AGCGCTGTGC CTTGTGTTGG TCTTCAC AG T CACCCTGTCC GTCTTCCCCG CCATCACAGC 126 0 

C ATGGTG AC C AG 12 72 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 247 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Ala Arg Gly Asp Ala Pro Arg Asp Ser Tyr His Leu Val Gly lie 
15 10 15 

Ser Phe Phe lie Leu Gly Leu Gly Thr Leu Leu Pro Trp Asn Phe Phe 
20 25 30 

lie Thr Ala lie Pro Tyr Phe Gin Ala Arg Leu Ala Gly Ala Gly Asn 
35 40 45 



Ser 



Thr Ala Arg lie Leu Ser Thr Asn His Thr Gly Pro Glu Asp Ala 
50 55 60 
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Phe Asn Phe Asn Asn Trp Val Thr Leu Leu Ser Gin Leu Pro Leu Leu 
65 70 75 80 

Leu Phe Thr Leu Leu Asn Ser Phe Leu Tyr Gin Cys Val Pro Glu Thr 
85 .90 95 

Val Arg lie Leu Gly Ser Leu Leu Ala He Leu Leu Leu Phe Ala Leu 
100 105 HO 

Thr Ala Ala Leu Val Lys Val Asp Met Ser Pro Gly Pro Phe Phe Ser 
115 120 125 

He Thr Met Ala Ser Val Cys Phe He Asn Ser Phe Ser Ala Val Leu 
130 135 140 

Gin Gly Ser Leu Phe Gly Gin Leu Gly Thr Met Pro Ser Thr Tyr Ser 
145 150 155 160 

Thr Leu Phe Leu Ser Gly Gin Gly Leu Ala Gly lie Phe Ala Ala Leu 
165 170 175 

Ala Met Leu Leu Ser Met Ala Ser Gly Val Asp Ala Glu Thr Ser Ala 
1B0 185 190 

Leu Gly Tyr Phe He Thr Pro Cys Val Gly He Leu Met Ser He Val 
195 200 205 

Cys Tyr Leu Ser Leu Pro His Leu Lys Phe Ala Arg Tyr Tyr Leu Ala 
210 215 220 

Asn Lys Ser Ser Gin Ala Gin Ala Gin Glu Leu Glu Thr Lys Ala Glu 
225 230 235 240 

Leu Leu Gin Ser Gly Lys Pro 
245 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
ATGGCGCGAG GAGACGCCCC GCGGGACAGC TACCACCTGG TCGGGATCAG CTTCTTCATC 60 
CTGGGGCTGG GCACCCTCCT TCCCTGGAAC TTCTTCATCA CCGCCATCCC GTACTTCCAG 120 



WO 98/46749 

GCGCGACTGG CCGGGGCCGG 
CCCGAGGATG CCTTCAACTT 
CTCTTCACCC TCCTCAACTC 
GGCAGCCTGC TGGCCATACT 
ATGAGCCCCG GACCCTTCTT 
AGTGCAGTCC TACAGGGCAG 
ACCCTCTTCC TCAGCGGCCA 
TCCATGGCCA GTGGCGTGGA 
GTGGGCATCC TCATGTCCAT 
TACTACCTGG CCAATAAATC 
CTCCTCCAGT CTGGTAAGCC 
AGAGCCACCT CCCCTGGGAA 
CCTTGTGCAA ACAGGAAGAT 
TGTTTGGGCC TCAGTTTCCA 
AAACTTTGGA TCGAAGACCT 
G C CTG AG AC A AACCACTTAG 
CAATGGTAAT GAGATAATCC 
TTCTGGTAGC TATG CAT ATT 
TGGAGGAAGA TCACCTTTTT 
ATCTCACTCT GTCACCTAGG 
CTTCTGGGCT CAAGTG AG CC 
GCCATGCCCG GCTAATTTTT 
TGGTCTCGAA CTCCTGGGCT 
TACAGGCGTG AG CCACCATG 
CTTCGTTGGG CCTGGCTGTC 
CTCACCCCAG ATGAGAACGG 
CTTGACCTGG AG AAGGAG CC 
TCAGTCTTCA CTGTCTTCCA 
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CAACAGCACA GCCAGGATCC TGAGCACCAA CCACACGGGT 180 

CAACAATTGG GTGACGCTGC TGTCCCAGCT GCCCCTGCTG 24 0 

CTTCCTGTAC CAGTGCGTOC CGGAGACGGT GCGCATTCTG 3 00 

GCTGCTCTTT GCCCTGACAG CAGCGCTGGT CAAGG TGG AC 3 60 

CTCCATCACC ATGGCCTCCG TCTGCTTCAT CAACTCCTTC 42 0 

CCTCTTCGGG CAGCTGGGCA CCATGCCCTC CACCTACAGC 4 80 

GGGCCTGGCT GGGATCTTTG CTGCCCTTGC CATGCTCCTG 54 0 

CGCCGAGACC TCTGCCCTGG GGTACTTTAT CACGCCCTGT 6 00 

CGTGTGTTAC CTGAGCCTGC CTCACCTGAA GTTTGCCCGC 66 0 

ATCCCAGGCC CAAGCTCAGG AGCTGGAGAC CAAAGCTGAG 72 0 

CTGAGAC C CT CCTGGGGAGG TGGGAGATGC AGAGGAAGCT 7 80 

GCTGTTCCAT CTGTTCCCAG CCAGAGCCCA CCC CTAGTAG 84 0 

CATGAAGGGA AGTTGG TAGG ATTAAAGTCA TCCCTGCTGT 900 

CCT C TAT AAA ATGGGGAGGC GACAGAAGTT CCATGCATGC 96 0 

CTGAATTGGA ATACTAGTTT CACAACATCC CRGCTGTGTG 102 0 

CCACTGCACC CCTCTGAACC TCAATGTGTC ATTTGTAAAG 108 0 

ATCTAAGGTG CTTCGCTCAT CACCCGACCC ATGCACGCGC 114 0 

TCCATCATGA ATTCCCTTCG CCTGCAGCCT CAGCTTAGGC 1200 

TTGTTTTGGG GTGAGGGGGT TGTTGTTATT TTGAGTCAGG 126 0 

CTG C ACTGCA GTGCTATCAC AACTCAACTG CAGCTTCGAC 1320 

ACCTCAGTCT CCCGAGTAGC TGGGACTATA GGTG CAGGCT 138 0 

TTATTTTTGT AGAGATGGTG AT TCACCATG TTGCCCAGGC 1440 

CAAGCAATAT GCCCGCTTCG GCCTTCCAAA ATGTTGGGAT 1500 

CCGAGCTGAG GATCACTTGT TTTAACTGCT GGGAATCTCC 1560 

GGGAAACCTG GGTCACAAGC ATG ACCCTTC CCCGTCCCCC 162 0 

GATTCCCAGT AGTCCCCAGA AAGTAGCTCT GAC CCTGG AT 1680 

GGAATCAGAG C C AG ATG AG C CCCAGAAGCC AGGAAAACCT 174 0 

GAAGATCTGG CTGACAGCGC TGTGCCTTGT GTTGGTCTTC 1800 




WO 98/46749 PCT/US98/07283 

85 

ACAGTCACCC TGTCCGTCTT CCCCGCCATC ACAGCCATGG TGACCAG 1847 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2396 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GCACTTTTGG GAGGCCAAGG GCAAGAGAAT CGCTTGAGCT CAGGAGTTCA AAACCAGCCT 6 0 

TGGGCAACAC AGTGAGACTT TGTCTGTACA CACACACACA CACACAAATT TTTAATGAAG 12 0 

AAAATAGAGG CCGGGTATGG TGGCTCACGC CTGTAATCCC AGCACTTTGG GAGGCTGAGG 180 

CAG GTGGATC ACTCGAGGTC AGAAGTTCGA GACCAGCCTG GCCAACATGA TGAAACCTGG 24 0 

CTCTACTAAA AATACAAAAA TTATCTGGGC ATGGTGGTGG CGGGCGCCTA TAGTCC CAG C 300 

TACTCAGGAG GCTGAAGCAG GAG GATTTCT TGAACCCAGG AGGTGGAGGT TGCAGTGAGC 360 

TGAGATCAGG GCCACTGCAC TCCAGCCTGG GCAACAGAGC GAGACTCCAT CTAAAAAAAA 420 

AAAAAAAAAG TACTCTATGG GTGTCCTGAG ATGCCCTGGA G CAG AG AC C T GGCTCCAGGG 480 

ACCATGCTGA CTTCAGCCTC TACCACAGCC AGACGAGGAC AGCCGGCTGC TGCCCCTGCT 540 

GGTCTGCCTG CGGTTCCTGT TCGTGCCCCT CTTCATGCTG TGCCACGTGC CCCAGAGGTC 600 

CCGGCTGCCC ATCCTCTTCC C AC AGGATG C CTACTTCATC ACCTTCATGC TGCTCTTTGC 66 0 

CGTTTCTAAT GGCTACCTGG TGTCCCTCAC CATGTG CCTG GCGCCCAGGT CCGGGCAATG 720 

GGTGGGTGGG GGGCTGGATT AGGAGGTGGT TTATCTTNGG GAAGGACCGC TGCAATGGAG 78 0 

GGACGGCCAT CCTGTTCTGG CCAGCCCAAC CTAGCTGTCT GCAGCCTTGC TGGCGCCCCN 84 0 

TACTGGCCAA GCTTAACTGC AGG GG AG AG A ACTGGGTAGG GAGGTACCCG CCCAACCAAG 900 

TAG CCCAGGC ACTGGTTCTG GGCCGCCTCA ATGTGCNTCA GTTTCCCCAT CTGTAAAAAA 96 0 

AAAAATGGGT TGAACTGTCA TCCCTCAGGG CCCATCTAAC TGTAAAATTC TCAGTTGAAG 1020 

GAGAGCTAAG GTTTTG A C C A AAAACAAGGT CATGGGCTAT TTCCTCAAGG GG CAATGG AG 1080 

TGGAGAATCC AGAGAGAATG AAGCTGGCAG GGCAGACAGG CTGAGAGCAC TGTGGAAAGG 114 0 
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GCAGGCTGTG GAATCTGGAA TCCCATCATG TTAGACTCAG AGGCCCTGAG AGACATCCTT 1200 

ATCCAGCAGC CTCATTTACA GACCAGGAAA CTGAGGCCCA GAAAGAAGGG GCCAGTTATG 1260 

GTGACAGAGG GGTTGGGTCA GAGCCCAGAC TGGATGGGCA GAGGGCAGTG GAGCTGGGTC 13 2 0 

CAGATTTAGA CCCAGCATTT TCTAAGAGCT CCTGTTCCCG GGTGTTTTAG GCAGGTGCTG 138 0 

CCACACGAGA GGGAGGTGGC CGGCGCCCTC ATGACCTTCT TCCTGGCCCT GGGACTTTCC 144 0 

TGTGGAGCCT CCCTCTCCTT CCTCTTCAAG GCGCTGCTCT GAAGTGGCCC CTCCAGGCTC 15 00 

TTTGGCAGCC TCTTCTCGAC GTCTCCTTCC GGAGCTGAGA TCCAGCCCAG GGCGAATGGC 15 6 0 

GAGCTTGGCT CAGGCCTCTG C GGGGTGG AG GCCCCTGGGC CTGAGGCTGC CAGCAGCGGG 16 2 0 

CAGGAGCTGC TCTTCATCCA CTTGGAGTGC TGCGGGGAAG AAATC AC C AC CGGTCATTCT 1680 

AACCCTCACC CAGGAATGGG GGTGACTCGC ACAAGACCTC ATGGAAAGGG TGATGACTAG 174 0 

GGAAAAGAGG GTGCAGGGCA CGGCTGCTCC CCACCACCAG GTCTGCATTT GTTCATCATC 18 00 

ATCAGG AG C A GAG GTGACC A GAGGGTTCAG AGTGGGAGGC AGGGCCAGCC CAGG CCAGG A 1860 

GCGCCTCATC TTCCCAGGCC TCAGCCACCC AGGGTAAAAG GTGCCAGGGA AGTTGTGGGC 1920 

AC CTGAGAGG AGGAACAGAT GTGGAGGACC TGAGGGTGCT CAAAGGGCCA GGCTCAGCCT 1980 

CAAGCAGTGT TTTCATTGCC AACACTTACT GTACCCACTC CGCAGAGCCC CGCTGGGCCT 2040 

GGGCCCCAGG GCCACAGCTA GCCTGCATGT GTGTACTGCA CTTTACAGTT TGCAAAGCTC 2100 

TTCCATACCC ACTCTCTCAC CGAAGCCTAA TTGAGGCTCT TGGAAGGAGT CAGG C AAGGA 2160 

TTGTGCTTCC CC C ATTATAC AGGTGACAAA ACTGAGTCCT GGGGAAAGTG ACTGGTCCGT 2220 

GGTAGAGCCG GGACCCAATC CCCTCTCTCT CCTCCCTGTT GGTGCTGTTC TTCCTGCCCA 2280 

ACACCTGTTT CTCTTTTCCT CAAGGGGTTT GGGGCAGGAG CCTGGGCACT TACTCCCCGT 2340 

TTTTG CTGTT TCTCCTTCTG ACCCTGCTCT TGGGTCTAAT AACCCCATTT ATTTGT 23 96 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "intron 1" 



(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GTGAGTGGGC CCGGGTGCGT TGCAAGTGGC CGGGGGCGTT GCAGACCCGC TCCCTGAAGG 6 0 

CGCTCCGAGG CTCAGAGAAG ACCGGATCGA ACTACAATTC CCATCAGCCG ACTCCCTCGG 12 0 

CCGCCGGAAC CGGGGTGATG G GGGTTGT AG TCCGCTCCGG AGGGGGTGGC CTGGGAGGCG 18 0 

GGAGGGCCTG CGGAGGCGGG TGCGTCCTCG GGGTGACCTT CCCACCGATC CCCACAG 237 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1712 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "intron 2" 

(iii) HYPOTHETICAL: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GTGAGACTCC TGGCGGCGTG GCAGCCTCGT GGCCACAGCC AGCACCCCTC CCTCCAGCCC 60 

TTTGGATGAA GCTTAGCGGG CGCTTCCTCA CCGCGCACCT GTGACCCCTC GTTGAGCTCA 12 0 

TTATGGGCTG AAGC TCGG AG AGGGGAATGC TTCCCCCGTG GCTAGGAAAG TAG AAC TTCA 18 0 

GCAATGCCCA CATCTCAGAG GGGCAAGGCC ACCAGCCCAC AGGGTCTGGA ATG AG CAAAG 24 0 

GCGCTGCCAC CCCTCTCTGT GTGTCGTTAT TCCTGAGTCA GTCACCCCAA AAGTCGGTTA 300 

TCGAACGTTT GATTTTCTTT GAAAT AC CAT GAATTTCATT CATTCACTCA TTCATTCAAC 360 

AAACTTTTTT TTTTTTTTTT TTTTTGAGAC AGAGTCTGGC TCTTTCGCCC AGGCTGGAGT 420 

GCAGTGGCGC CATCTCGGCT C AC TGCAAG C TCCGCCTCCC GGGTTCACGC CATTCTCCTG 4 80 

CCTCACCCTC CCGAGTAGCT GGGACTACAG GCGCCCGCCG CCACGCGCGG CTATTTTTTT 54 0 

TTGTATTTTT AG T AG AGACG GGGTTTCACC GTGTTAGCCA GGATGGTCTC GATCTCCTGA 600 

CCTCGTGATC CTCCCGCCTC GGCCTCCCAA AGTGCTGGGA TTACAGGCGT GAGCCACCGT 660 

GCCCAGCCTC ATTCAACAAA CTTTTAGTGT GCATCTACTG TGGAGCAGGC ACTGGGGACA 720 

CAGGAGGAAA CAG CAGGG AG GCTCTTCAGG GAAGGCAGAA ATGTGGGGTT TGCATTGTCT 78 0 

TTGGGACCGG GTT ATT CATC TGTATT C ACT GCAACAACTT TGCAAATGCT TCTTGGGTAC 84 0 
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TGGCTCTGTG CTGGGCCCTG GAAACCCAGA GATGAATCAG CCCCTGGGCT TGAGAGCAAG 900 

AAGGGGCCAA AG AG CT ATT A ATAATGTAAC ATGATGCGTG ACATTCCAGG CTTGCAGCAG 96 0 

AGTGCAGTGG GTCCCCAGGG AAGGAGAAAG TTCCTTCTGC TTCATGGAAG AGGAGATTTG 1020 

TAAATTGGGA GTAGGGTAGG CAAAGTGCGT GTGGAGGGGT G TGG TCAGTA GGGCATTCCA 1080 

AGC CGAGGCG ACAGCCATGC CAAAGGCAGG CAGGCAAGAG ACGATCAGCC TGTTTAGAGG 1140 

GAG ATT C CAC AGCCAGGGCT GCCTGGAGCT TAGCAGGATG GAG C AGAAGA TGGGGCACAA 12 00 

AGGGAGACTA GGATCTGATT CTGAAGAGCT GTTCCATTTG GGGCTTTGCC CTGCAGGCAA 1260 

T AGGGAGG C A TGAATCGGGG TGTTGAGGAG TGAGGAGGTT AAGCAGAGGA GTGGCAGGCT 132 0 

ATGTGCTCTA GAGAGAATGC AGTTGTTCAG CACTTAGGCC AAAGCCTGGC TTGACAGTAG 13 8 0 

G CGCT CAAT A AATACCCGTG GAATGAATGA ATGTAGCAGC TGCTGCAGGA GTGGGGATGG 144 0 

GGGCTGGAAC CAGGGCACTG AAGAGGAGGG GCCGTCCAAG GCTGGATCGA GGCTCTTGCT 1500 

GGGGGCTCTT AATGCTTAGG CTGTGTCCCA GACTTCAGCC ATTTAACTCA GCACACATTC 156 0 

ATGGAGCTCC GACTGTGTGC CGGCTGCTGG GAACAGAACA AGGGCAAGGC AGACAAGTCC 162 0 

C CAC AG AC AT TAGAAAGCAA TCACAAGTGA GGGGGAAGGC AGCGGGGGAA GGCTGAGAGG 16 8 0 

TGCTGACCCT CCACCACCTC CCTACCTGGC AG 1712 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "intron 3" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GTGAGAGGCC TGCCCTGGCT CCTGCGCCCT CTGCCGAGGC AG CTTCATTG AGGCCCTCCC 6 0 

CTGCGCCCCC TGCCCTCCAG CCCTACTGCC CAGCCCCAGG TGTCGAGCCT CCTTCCCCAG 120 
CCCCCTCTGG CCTGGGCCCC ACTGATGCAC TCTGCCTGCT TCTGAGCAAG 170 
(2) INFORMATION FOR SEQ ID NO : 14 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "intron 4" 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GTGAGCACCT CCACCCCCTC TCCAGCCAGC CTATGCAGGG CTTCAGCCTG GCCTCATCAT 6 0 

TGAAAGGGCC CAG CAT AT C C GAGAAGGGCA GACAGCATCA YGGTCGCTCA TATCCCTGGT 120 

GAAGAAACTG AGGCCCACAG GGAGGGGAAG AGTCACTTGT CCGGTGACCT AGGAGCAGGC 18 0 

CTCCTGGTCA ACAGCCCCAC AGACCAATGG CTGCACCTCA GAAGAGGACT GAATAGCGGG 24 0 

TGTTGCCCCC GAGTGCTCAG AG TCC CTAGG GAAGCTCACA CCTGCGCAAC CTTGTCCAGA 3 00 

GTCCCCTGTG TATCCTGCCG GCACCTCCTC CAGGGAGCCT CTGAGTCTTG CCTAGTTGAG 3 60 

CAGCAGCCCC CATCCCTGTC CTCCACAG 3 88 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 667 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "intron 5" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GTGAGTGCAC TTGGGTGGCT GGGAGGGCTG GGGTGGCCTC TGAGGTTTGG GGAAGAGAGA 60 

GGG CATGTG A GAGCAAGACA CATGGGTTCT GGGTGAAGAT GGAGGTAAGC GGGTGATATG 120 

GAAATGGGGA TTGGTCTGGG GCTAGGGAAT GGGGCTCATG GGCCCTGCAG TGAGGAGTAA 180 

TAACCAAGTG AGGACTGGGT TAACTCAGGG ACAGGGGCAG GATTCCTGGG GCTAATACTG 240 



WO 98/46749 PCT/US98/07283 

90 

GCATGTGGCA GCAGGTTGAA GTTGAAGGAT AAGGGGATGG GTTTGGGATT CAGATAGTCT 300 

TGGGTTTGAA TCTGCTTCAC CGCTTACCAG CTAGGGGTGT TGGACAAGGC TTGTCACCTC 360 

TCTGTGCTAG CTTCCCCACC GATGTGATTG GTACAGCTCC CTGCTCAGAT TGTAATGAGC 42 0 

ATGCAATGAG AGAAGGCTAC TGGCACATAG TACGTGCTCA ACAAAAATGA CACATGGGGA 480 

AAGTGAGAGA AGTGCAGGGC TGCTCTGGGG CCCTGTACAA GATTCCCATT TGTCAGTGAA 54 0 

GGGAGGAGCG GAAGAGG CTG GGAGTGGGTC TGAGAAGTAC ACAATGGGAA GTGGGACAAG 600 

AGTTGGAAGC CCCGTGGGAG CCGGCGGGAC CAGGTGCCTC TCTTCTGCAG CTGAAGTTCC 66 0 

TCCGCAG 66 7 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TO POLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "intron 6" 

(iii) HYPOTHETICAL: NO 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GTGAGCCTG C TGTTGGGCTC GAGGCCCCAC CTCAAAGCAT CTTGGATAGA GTCCTGAGCC 6 0 

TGAAGCCCTG AG AG AGG C C A GGGGAGGTGG AGGAGACCTG GTCTCAGCCC TGACCCCCAG 120 

AGAAGACACT GAGGGGCCCC AGCCTCCAGG CCAATGGTAT GGGGAGGGAT CCAGACACCT 18 0 

CAGGCAAGCC AGGCAGGCCC AACACTTTCC TGTCCTTCTG CAG 223 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

<A) DESCRIPTION: /desc = "intron 7" 

(iii) HYPOTHETICAL: NO 
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(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GTAAGCCCTG AGACCCTCCT GGGGAGGTGG GAGATGCAGA GGAAGCTAGA GCCACCTCCC 60 

CTGGGAAGCT GTTCCATCTG TTCCCAGCCA GAGCCCACCC C TAG TAG C CT TGTGCAAACA 120 

GGAAGATCAT GAAGGGAAGT TGGTAGGATT AAAGTCATCC CTGCTGTTGT TTGGGCCTCA 180 

GTTTCCACCT CTATAAAATG GGGAGGCGAC AGAAGTTCCA TGCATGCAAA CTTTGGATCG 24 0 

AAGACCTCTG AATTGGAATA CTAGTTTCAC AAC AT CCCRG CTGTGTGGCC TGAGACAAAC 3 00 

C AC TT AG CCA CTGCACCCCT CTGAACCTCA ATGTGTCATT TGTAAAGCAA TGGTAATGAG 3 60 

ATAATCCATC TAAGGTGCTT CGCTCATCAC CCGACCCATG CACGCGCTTC TGGTAGCTAT 42 0 

GCATATTTCC ATCATGAATT CCCTTCGCCT GCAGCCTCAG CTTAGGCTGG AGGAAGATCA 4 80 

CCTTTTTTTG TTTTGGGGTG AGGGGGTTGT TGTTATTTTG AGTCAGGATC TCACTCTGTC 54 0 

ACCTAGGCTG CACTGCAGTG CTATCACAAC TCAACTGCAG CTTCGACCTT CTGGGCTCAA 6 00 

GTGAGCCACC TCAGTCTCCC GAGTAGCTGG GACTATAGGT GCAGGCTGCC ATGCCCGGCT 660 

AATTTTTTTA TTTTTGTAGA GATGGTGATT CACCATGTTG CCCAGGCTGG TCTCGAACTC 72 0 

CTGGGCTCAA G CAAT ATG CC CGCTTCGGCC TTCCAAAATG TTGGGATTAC AGGCGTGAGC 78 0 

CACCATGCCG AGCTGAGGAT CACTTGTTTT AACTGCTGGG AATCTCCCTT CGTTGGGC CT 84 0 

GGCTGTCGGG AAACCTGGGT CACAAGCATG ACCCTTCCPC GTCCCCCCTC ACCCCAG 8 97 
(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 199 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "intron 8" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GTTTGG CTTG GATACAGCCC CCAACCACCA TCTTTGGGGA AGAATGGGGC TCACATTGAC 6 0 

TCCAAGGT C A TAGGGTCACA GTGGGTCAGG GACAC AG CTG GGCCAGGCCC CAAGTGTCCT 120 

GCTCCCACAT GGGGCTTGGG CAAGAGGGTG GGGCCCTGGG ACTGCCCTGC CTGCTCACAC 18 0 
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CCCTGCCTCY GGCTCCCAG 19 9 

(2) I NFORMAT I ON FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " intron 9" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GTGAGTGTCA GGGTGGAGAA GACGGCAGGG CAGGGGGTAC AAAGGGGAGA GGACGGGAGA 60 
GGGGAGTTGG AGACCAGTAT GAGCTGCAGC CGTTTCCCTC CCAG 104 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "intron 10(5') " 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GTAAGCACAC CAGGGCTGGG TGATCCGATG TTTTAGGAAG C AG TTTGGGA TCCGAGGGCT 60 

TGAAAGAGCA CGGAGGTGAT TTTCTGGTAG TCCAAGTGGC CTGGTAATGC AACCACTGGC 120 

CAAGCAGCAG GGAGCACTTG GGCCCTGGAG GCGTGCAAGG CCAGGGCTTG CACTG TGAGC 180 

TCCCTGAAAG CAAAAATCAT GTCCAGCTGA CCTCTGTGTC CCCAGCATCC AGCCTTTGCT 24 0 

GCTCAGAGAA TGTTACATGG AGGTTCCTGC ACCAGGTGAG GGACTGAGCA AGATCTTAGT 3 00 

TTTGGGGTTG GTTTTAG CC A TGGTGCTGTA TCTTTAAATG AAATCTTCCA AAGAGACAAT 3 60 

ACATAACG C A GGTGAAAGAN GANCTGGTCT CATCCAAGTC AGGACANNGA GCTGATCTAC 420 
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AGCTTCCAAT CCCACTCAGA AACCCTCTGC CCCCAANGGG G 



461 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "intron 10(3*) *' 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GCACTTTTGG GAGGCCAAGG GCAAGAGAAT CGCTTGAGCT CAGGAGTTCA AAACCAGCCT 6 0 

TGGGCAACAC AGTGAGACTT TG TCTGT AC A CACACACACA CACACAAATT TTTAATGAAG 12 0 

AAAATAGAGG CCGGGTATGG TGGCTC ACG C CTGTAATCCC AG CACTTTGG GAGGCTGAGG 18 0 

CAGGTGGATC ACTCGAGGTC AGAAGTTCGA GACCAGCCTG GCCAACATGA TGAAACCTGG 24 0 

CTCTACTAAA AATACAAAAA TTATCTGGGC ATGGTGGTGG CGGGCGCCTA TAGTCC C AG C 3 00 

TACTCAGGAG GCTGAAGCAG GAGGATTTCT TGAACCCAGG AGGTGGAGGT TG CAGTG AG C 360 

TGAGATCAGG GCCACTGCAC TCCAGCCTGG G C AAC AG AG C G AG ACTC CAT CTAAAAAAAA 42 0 

AAAAAAAAAG TACTCTATGG GTGTCCTGAG ATGCCCTGGA GCAGAGACCT GGCTCCAGGG 48 0 

ACCATGCTGA CTTCAGCCTC TACCACAG 508 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 662 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "intron 11" 

(iii) HYPOTHETICAL : NO 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
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GTCCGGGCAA TGGGTGGGTG GGGGGCTGGA TTAGGAGGTG GTTTATCTTN GGGAAGGACC 6 0 

GCTGCAATGG AGGGACGGCC ATCCTGTTCT GGCCAGCCCA ACCTAGCTGT CTGCAGCCTT 120 

GCTGGCGCCC CNTACTGGCC AAGCTTAACT GCAGGGGAGA GAACTGGGTA GGGAGGTACC 18 0 

CGCCCAACCA AGTAGCCCAG GCACTGGTTC TGGGCCGCCT CAATGTGCNT CAGTTTCCCC 24 0 

ATCTGTAAAA AAAAAAATGG GTTGAACTGT CATCCCTCAG GGCCCATCTA ACTGTAAAAT 3 00 

TCT CAGTTG A AGGAGAG CTA AGGTTTTGAC CAAAAACAAG GTCATGGGCT ATTTCCTCAA 3 60 

GGGGCAATGG AGTG GAGAAT CCAGAGAGAA TGAAGCTGGC AGGGCAGACA GGCTGAGAGC 420 

ACTGTG GAAA GGGCAGGCTG TGGAATCTGG AATCC C AT C A TGTTAGACTC AGAGGCCCTG 480 

AGAG AC AT C C TTATCCAGCA GCCTCATTTA CAGACCAGGA AACTGAGGCC CAGAAAGAAG 54 0 

GGGCCAGTTA TGGTGACAGA GGGGTTGGGT C AG AG C C C AG ACTGGATGGG CAGAGGGCAG 6 00 

TGGAGCTGGG TCCAGATTTA GACCCAGCAT TTTCTAAGAG CTCCTGTTCC CGGGTGTTTT 6 60 

AG 662 
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WHAT IS CLAIMED IS : 
II. An isolated nucleic acid encoding a nitrobenzylmercaptopurineriboside (NBMPR) 

2 insensitive, equilibrativc nucleoside transport protein (/ENTP) or active fragment thereof, 

3 wherein the /'ENTP: 

4. (a) functions as an equilibrative nucleoside transport protein; 

5 (b) is insensitive to NBMPR; and 

6 (c) is a protein containing approximately 450 amino acid residues. 

1 2. The isolated nucleic acid of Claim 1 having a nucleotide sequence with at least 80% 

2 identity with the coding sequence of SEQ ID NO: 1 . 

1 3. The isolated nucleic acid of Claim 2 having the nucleotide sequence of nucleotides 

2 23 8- 1 605 of SEQ ID NO: I . 

1 4. The isolated nucleic acid of Claim .1 wherein the /ENTP has an amino acid sequence 

2 selected from the group consisting of SEQ ID NO:2 and SEQ ID NO:2 comprising a 

3 conservative substitution thereof. 

1 5. The isolated nucleic acid of Claim 1 which is a DNA. 

1 6. A DNA construct comprising the DNA of Claim 5 that is operative ly linked to an 

2 expression control sequence. 

1 7. A cell which is transfected or transduced with the DNA construct of Claim 6. 

1 8. The transfected or transduced cell of Claim 7 that is a prokaryotic cell. 

1 9. The transfected or transduced cell of Claim 7 that is a eukaryotic cell. 

1 10. The eukaryotic cell of Claim 9 that is a COS cell. 



1 



11. 



The eukaryotic cell of Claim 9 that is a human T-cell leukemia CEM cell. 



WO 98/46749 PCT/US98/07283 

96 . 

1 12. An isolated NBMPR insensitive, equilibrative nucleoside transport protein (/ENTP) 

2 or active fragment thereof, wherein the /ENTP: 

3 (a) functions as an equilibrative nucleoside transport protein; 

4 (b) is insensitive to NBMPR; and 

5 . (c) is a protein containing approximately 450 amino acid residues. 

1 13. The isolated /ENTP of Claim 12 having an amino acid sequence selected from the 

2 group consisting of SEQ ID NO:2 and SEQ ID NO:2 comprising a conservative substitution 

3 thereof. 

1 14. The isolated /"ENTP of Claim 12 having an amino acid sequence encoded by a 

2 nucleotide sequence with at least 80% identity with the coding sequence of SEQ ID NO: 1 . 

1 15. A fusion protein comprising a NBMPR insensitive, equilibrative nucleoside transport 

2 protein (/ENTP), wherein the /ENTP: 

3 (a) functions as an equilibrative nucleoside transport protein; 

4 (b) is insensitive to NBMPR; and 

5 (c) is a protein containing approximately 450 amino acid residues. 

1 16. The fusion protein of Claim 1 5 wherein the nucleoside transport protein has an 

2 amino acid sequence selected from the group consisting of SEQ ID NO:2 and SEQ ID NO:2 

3 comprising a conservative substitution thereof. 

1 1 7. An antibody to an NBMPR insensitive, equilibrative nucleoside transport protein 

2 (/ENTP) wherein the /ENTP: 

3 (a) functions as an equilibrative nucleoside transport protein; 

4 (b) is insensitive to NBMPR; and 

5 (c) is a protein containing approximately 450 amino acid residues. 
1 18. The antibody of Claim 1 7 that is a monoclonal antibody. 

I 1 9. The antibody of Claim 1 8 that is a chimeric antibody. 



1 



20. 



An immortal cell line that produces a monoclonal antibody according to Claim 18. 
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1 21. The transfected or transduced cell of Claim 7 in which all detectable nucleoside 

2 transport activity is performed by the /ENTP encoded by the DNA. 

1 22. The transfected or transduced cell of Claim 21 which is a human cell. 

1 23. A nucleoside transport deficient subline of a human T-cell leukemia cell line CEM, 

2 transfected with an Epstein-Barr Nuclear Antigen 1 expression cassette, wherein the cell line 

3 is capable of supporting the episomal replication of the Epstein-Barr virus-based mammalian 

4 expression vector pDR2; and wherein the cell line has a stable transfection frequency with 

5 pDR2 of approximately 1 0 2 . 

1 24. A knockout mouse comprising a first and a second allele which naturally encode and 

2 express the /ENTP of Claim 12; 

3 wherein the first allele and the second allele each contain a defect which prevents the 

4 knockout mouse from expressing the /ENTP; and 

5 wherein the knockout mouse has a functional phenotype of being particularly 

6 susceptible to drugs such as NBMPR. 

1 25. A method of isolating a cDNA encoding an NBMPR insensitive, equilibrative 

2 nucleoside transport protein {iENTP cDNA) comprising: 

3 (a) transfecting a nucleoside transport protein deficient cell with an expression 

4 vector from an expression vector library; wherein a cDNA library encoding an iENTP is 

5 represented in the expression vector library; and wherein the iENTP cDNA is expressed in a 

6 transfected cell; 

7 (b) selecting for an expression vector containing the iENTP cDNA by culturing 

8 the transfected cell under conditions in which the cell growth is dependent on the expression 

9 of the /ENTP and its corresponding transport activity; and wherein the selected expression 

1 0 vector contains the iENTP cDNA; 

1 1 (c) extracting the selected expression vector from the transfected cell; 

12 (d) transfecting a host cell with the selected expression vector; and 

1 3 (e) isolating the cDNA encoding the NBMPR insensitive, equilibrative, 

14 nucleoside transport protein. 
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1 26. A method of making an /ENTP by culturing the transfected or transduced ceil of 

2 Claim 7, wherein the /ENTP is expressed. 

1 27. The method of Claim 26 wherein the /ENTP has an amino acid sequence selected 

2 from the group consisting of SEQ ID NO:2, and SEQ ID NO:2 comprising a conservative 

3 substitution thereof. 

1 28. A method of obtaining a purified NBMPR insensitive, equilibrativc nucleoside 

2 transport protein (/ENTP) from a cell that expresses the /ENTP comprising lysing the cell, 

3 and purifying the NBMPR insensitive, equilibrative nucleoside transport protein. 

1 29. The method of Claim 28, wherein the /ENTP has an amino acid sequence selected 

2 from the group consisting of SEQ ID NO:2, and SEQ ID NO:2 comprising a conservative 

3 substitution thereof. 

1 30. A method of identifying a ligand of an equilibrative, NBMPR insensitive nucleoside 

2 transport protein (/ENTP) comprising: 

3 (a) contacting a potential ligand with the /ENTP of Claim 12 under 

4 physiological conditions; and 

5 (b) detecting whether the potential ligand binds to the /ENTP wherein a potential 

6 ligand is selected as a ligand if it binds to the /ENTP. 

1 31. The method of identifying a ligand of an /ENTP of Claim 30, wherein said detecting 

2 includes determining the dissociation constant between the potential ligand and the /ENTP, 

3 and wherein a potential ligand is selected as a ligand when the dissociation constant is less 

4 thanlO- 5 M. 

1 32. A method of identifying a permeant of an NBMPR insensitive, equilibrative 

2 nucleoside transport protein (/ENTP) comprising: 

3 (a) contacting a nucleoside or nucleoside analog with the transfected or 

4 transduced cell of Claim 21; 

5 (b) evaluating whether the nucleoside transport of the nucleoside or nucleoside 

6 analog by the transfected or transduced cell follows a facilitated diffusion process; wherein 

7 the nucleoside or nucleoside analog is identified as a permeant when the transport of the 
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8 nucleoside or nucleoside analog into the transfected or transduced cell is determined to 

9 follow a facilitated diffusion process. 

1 33. The method of Claim 32 wherein the nucleoside or nucleoside analog is selected 

2 . from the group consisting of an antiviral nucleoside analog and an antitumor nucleoside 

3 analog. 



1 34. A method of selecting a drug that inhibits an NBMPR insensitive, equilibrative 

2 nucleoside transport protein (/EN TP) comprising: 

3 (a) contacting a potential drug with the transfected or transduced cell of Claim 

4 21; and 

5 (b) evaluating a nucleoside transport activity of the transfected or transduced 

6 cell; wherein a potential drug is selected as a drug when a decrease in the nucleoside 

7 transport activity is determined by said evaluating, relative to the determination in the 

8 absence of the potential drug. 

1 35. The method of Claim 34 wherein the nucleoside transport activity of the transfected 

2 or transduced cell is selected from the group consisting of the trans-stimulation of a 

3 permeant, direct transport of a permeant, countertransport of a permeant, and the toxicity of a 

4 toxic nucleoside analog which is a permeant of the zENTP. 
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